US 11,893,351 B2
Modified machine learning model and method for coherent key phrase extraction
Oren Sar Shalom, Nes Ziona (IL); and Yehezkel Shraga Resheff, Jerusalem (IL)
Assigned to Intuit Inc., Mountain View, CA (US)
Filed by Intuit Inc., Mountain View, CA (US)
Filed on Aug. 22, 2022, as Appl. No. 17/893,153.
Application 17/893,153 is a continuation of application No. 16/805,688, filed on Feb. 28, 2020, granted, now 11,436,413.
Prior Publication US 2022/0405476 A1, Dec. 22, 2022
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 40/289 (2020.01); G06F 17/18 (2006.01); G06N 20/00 (2019.01)
CPC G06F 40/289 (2020.01) [G06F 17/18 (2013.01); G06N 20/00 (2019.01)] 18 Claims
OG exemplary drawing
 
1. A method comprising:
receiving, in a machine learning model, a corpus comprising a plurality of words comprising natural language terms, wherein the machine learning model comprises a plurality of layers configured to extract a plurality of keywords out of the corpus and further comprises a retrospective layer;
identifying, in the plurality of layers, a first keyword from the corpus and a second keyword from the corpus;
assigning the first keyword a first probability and the second keyword a second probability, wherein each probability is a corresponding likelihood that a corresponding keyword is to be included in a key phrase;
determining, in the retrospective layer, a first probability modifier that modifies the first probability based on a first dependence relationship between the second keyword being placed after the first keyword,
wherein the first probability modifier punishes the first keyword when the first keyword depends on the second keyword and the second keyword comes after the first keyword in the corpus, and
wherein punishes comprises reducing a probability that the first keyword is in the key phrase;
modifying the first probability using the first probability modifier to form a first modified probability;
using the first modified probability to determine whether the first keyword and the second keyword together form the key phrase; and
storing the key phrase in a non-transitory computer readable storage medium.