US 12,314,341 B2
Method and apparatus for training embedding vector generation model
GyuBum Han, Suwon-si (KR); Jehun Jeon, Suwon-si (KR); and Inkyu Choi, Seoul (KR)
Assigned to Samsung Electronics Co., Ltd., Suwon-si (KR)
Filed by Samsung Electronics Co., Ltd., Suwon-si (KR)
Filed on Jan. 20, 2021, as Appl. No. 17/153,011.
Claims priority of application No. 10-2020-0104025 (KR), filed on Aug. 19, 2020.
Prior Publication US 2022/0058433 A1, Feb. 24, 2022
Int. Cl. G06F 18/20 (2023.01); G06F 16/33 (2025.01); G06F 18/214 (2023.01); G06F 18/22 (2023.01); G06F 40/295 (2020.01); G06F 40/30 (2020.01)
CPC G06F 18/214 (2023.01) [G06F 18/22 (2023.01); G06F 40/295 (2020.01); G06F 40/30 (2020.01)] 20 Claims
OG exemplary drawing
 
1. A processor-implemented method of training an embedding vector generation model, the method comprising:
identifying a keyword in a query sentence;
generating an embedding vector of the query sentence and an embedding vector of the keyword based on the embedding vector generation model; and
training the embedding vector generation model such that a first similarity between the embedding vector of the query sentence and the embedding vector of the keyword is greater than a second similarity between an embedding vector of a reference sentence and the embedding vector of the keyword,
wherein the embedding vector generation model is pre-trained based on any one or any combination of a general sentence and a conversational sentence, and
wherein the keyword comprises at least one word in the query sentence and a length of the keyword is less than a length of the general sentence or the conversational sentence.