| CPC G06F 40/284 (2020.01) [G06N 3/045 (2023.01); G06N 3/08 (2013.01)] | 20 Claims |

|
1. A system for attentive sentence similarity scoring, the system comprising:
a computer-readable medium storing instructions that are operative upon execution by a processor to:
precompute a plurality of candidate sentences using a trained distilled sentence embedding (DSE) language model, wherein the trained DSE language model generates a plurality of candidate sentence representations based on the plurality of candidate sentences;
receive an input query including a selected sentence;
generate a set of similarity scores for each candidate sentence in the plurality of candidate sentences and the selected sentence by comparing a representation of the selected sentence to each candidate sentence representation in the plurality of candidate sentence representations, wherein the trained DSE language model is trained using training pairs of sentences and a set of teacher similarity scores obtained from a transformer language model, and wherein the trained DSE language model is trained to approximate the set of teacher similarity scores for the training pairs of sentences using a low-cost similarity function;
select a set of similar sentences from the plurality of candidate sentences based on the set of similarity scores; and
generate a response to the input query comprising the set of similar sentences.
|