US 12,223,279 B2
Method for generating cross-lingual textual semantic model, and electronic device
Yaqian Han, Beijing (CN); Shuohuan Wang, Beijing (CN); and Yu Sun, Beijing (CN)
Assigned to BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD., Beijing (CN)
Filed by BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD., Beijing (CN)
Filed on Nov. 11, 2022, as Appl. No. 18/054,608.
Claims priority of application No. 202111647494.9 (CN), filed on Dec. 29, 2021.
Prior Publication US 2023/0080904 A1, Mar. 16, 2023
Int. Cl. G06F 40/30 (2020.01); G06N 5/022 (2023.01)
CPC G06F 40/30 (2020.01) [G06N 5/022 (2013.01)] 15 Claims
OG exemplary drawing
 
1. A method for generating a cross-lingual textual semantic model, comprising:
acquiring a set of training data, wherein the set of training data comprises pieces of monolingual non-parallel text and pieces of bilingual parallel text;
determining a semantic vector of each piece of text in the set of training data by inputting each piece of text in the set of training data into an initial textual semantic model;
determining a distance between semantic vectors of each two pieces of text in the set of training data based on the semantic vector of each piece of text in the set of training data;
determining a gradient modification based on a parallel relationship between each two pieces of text in the set of training data and the distance between the semantic vectors of each two pieces of text in the set of training data; and
acquiring a modified textual semantic model by modifying the initial textual semantic model based on the gradient modification.