US 12,204,587 B2
	Document similarity through reference links
Yonatan Ben-Simhon, Tel Aviv (IL); Rami Cohen, Ashkelon (IL); Oren Sar Shalom, Nes Ziona (IL); and Alexander Zhicharevich, Petah Tikva (IL)
Assigned to Intuit Inc., Mountain View, CA (US)
Filed by INTUIT INC., Mountain View, CA (US)
Filed on Oct. 29, 2019, as Appl. No. 16/667,189.
Prior Publication US 2021/0124783 A1, Apr. 29, 2021
Int. Cl. G06F 16/9032 (2019.01); G06F 17/18 (2006.01); G06F 18/22 (2023.01); G06N 3/04 (2023.01)

CPC G06F 16/90332 (2019.01) [G06F 17/18 (2013.01); G06F 18/22 (2023.01); G06N 3/04 (2013.01)]

20 Claims

1. A method, comprising:

receiving a set of question and answer (Q/A) pairs;

identifying a set of references in the set of Q/A pairs that link pairs of Q/A pairs of the set of Q/A pairs, wherein the set of references is based on at least an identifier of a first Q/A pair of the set of Q/A pairs being included in a second Q/A pair of the set of Q/A pairs;

identifying popular Q/A pairs of the set of Q/A pairs based on the set of references, wherein:

the popular Q/A pairs are referenced by a subset of the set of Q/A pairs;

identifiers of the popular Q/A pairs are included in specific Q/A pairs of the subset of the set of Q/A pairs; and

each respective Q/A pair of the subset of the set of Q/A pairs comprises a respective question of a plurality of questions; and

training a machine learning model through a supervised learning process to predict Q/A pairs of the set of Q/A pairs that are relevant to a given question, wherein the supervised learning process comprises providing a particular question of the plurality of questions to the machine learning model, receiving a prediction from the machine learning model as to whether a popular Q/A pair of the Q/A pairs is relevant to the particular question, and iteratively adjusting parameters of the machine learning model based on comparing the prediction to a label that is based on whether the set of references indicates that the popular Q/A pair is relevant to the particular question.