CPC G06F 40/166 (2020.01) [G06F 40/289 (2020.01); G06F 40/30 (2020.01); G06N 3/082 (2013.01)] | 19 Claims |
1. A method for language processing, comprising:
identifying a simplified text that includes original information from a complex text and additional information that is not in the complex text;
computing an entailment score for each sentence of the simplified text using a neural network, wherein the entailment score indicates whether the sentence of the simplified text includes information from a sentence of the complex text corresponding to the sentence of the simplified text;
generating a modified text based on the entailment score, the simplified text, and the complex text, wherein the modified text includes the original information and excludes the additional information;
computing a first similarity score based on the complex text and the simplified text;
computing a second similarity score based on the complex text and the modified text;
comparing the first similarity score to the second similarity score; and
providing the modified text based on the comparison.
|