我想训练我的应用程序的短语相似性。我希望我的模型能够预测短语的相似度得分,如下例所示。前任-
International Business Machines = I.B.M
Synergy Telecom = SynTel
Beam inc = Beam Incorporate
Sir J J Smith = Johnson Smith
Alex, Julia = J Alex
James B. D. Joshi = James Joshi
James Beaty, Jr. = Beaty
是否有任何数据集可用于训练这种类型的模型?