US 11,875,115 B2
	Learned evaluation model for grading quality of natural language generation outputs
Thibault Sellam, New York City, NY (US); Dipanjan Das, Jersey City, NJ (US); and Ankur Parikh, New York City, NY (US)
Assigned to Google LLC, Mountain View, CA (US)
Filed by GOOGLE LLC, Mountain View, CA (US)
Filed on Dec. 12, 2022, as Appl. No. 18/079,148.
Application 18/079,148 is a continuation of application No. 17/003,572, filed on Aug. 26, 2020, granted, now 11,551,002.
Prior Publication US 2023/0110829 A1, Apr. 13, 2023
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 40/289 (2020.01); G06F 40/205 (2020.01); G06F 40/47 (2020.01); G06F 40/51 (2020.01)

CPC G06F 40/289 (2020.01) [G06F 40/205 (2020.01); G06F 40/47 (2020.01); G06F 40/51 (2020.01)]

22 Claims

1. A method of training a neural network, comprising:

generating, by one or more processors of a processing system, for each given synthetic sentence pair of a plurality of synthetic sentence pairs, each given synthetic sentence pair comprising an original passage of text and a modified passage of text:

a first training signal of a plurality of training signals based on whether the given synthetic sentence pair was generated using backtranslation; and

one or more second training signals of the plurality of training signals based on a prediction from a textual entailment model regarding a likelihood that the modified passage of text of the given synthetic sentence pair entails or contradicts the original passage of text of the given synthetic sentence pair;

pretraining, by the one or more processors, the neural network to predict, for each given synthetic sentence pair of the plurality of synthetic sentence pairs, the plurality of training signals for the given synthetic sentence pair; and

fine-tuning, by the one or more processors, the neural network to predict, for each given human-graded sentence pair of a plurality of human-graded sentence pairs, a grade allocated by a human grader to the given human-graded sentence pair.