US 12,154,008 B2
Entity analysis system
Amit R. Kapur, Venice, CA (US); Steven F. Pearman, Redondo Beach, CA (US); and James R. Benedetto, Hermosa Beach, CA (US)
Assigned to Verizon Patent and Licensing Inc., Basking Ridge, NJ (US)
Filed by Verizon Patent and Licensing Inc., Basking Ridge, NJ (US)
Filed on Mar. 12, 2021, as Appl. No. 17/199,691.
Application 17/199,691 is a continuation of application No. 15/381,253, filed on Dec. 16, 2016, granted, now 10,984,339.
Application 15/381,253 is a continuation of application No. 14/926,059, filed on Oct. 29, 2015, granted, now 9,558,456, issued on Jan. 31, 2017.
Application 14/926,059 is a continuation of application No. 13/205,585, filed on Aug. 8, 2011, granted, now 9,202,176, issued on Dec. 1, 2015.
Prior Publication US 2021/0201203 A1, Jul. 1, 2021
This patent is subject to a terminal disclaimer.
Int. Cl. G06N 20/00 (2019.01); G06F 40/279 (2020.01); G06N 5/02 (2023.01); G06N 7/01 (2023.01)
CPC G06N 20/00 (2019.01) [G06F 40/279 (2020.01); G06N 5/02 (2013.01); G06N 7/01 (2023.01)] 20 Claims
OG exemplary drawing
 
1. A computer-implemented method of learning related entities, the method comprising:
receiving a plurality of entities, each entity among the plurality of entities relating to a first concept;
receiving training content including the plurality of entities; and
learning additional entities that are related to the first concept by iteratively performing the following steps:
identifying one or more potential word templates from the training content based on occurrences of one or more words in the training content relating to the first concept;
identifying one or more word templates from the one or more potential word templates based on a frequency of occurrence of the one or more potential word templates and based on one or more part-of-speech tags of the one or more potential word templates;
adding the one or more identified word templates to a set of word templates;
generating, for each identified word template, a confidence score for the identified word template based on a frequency of occurrence of the identified word template; and
adjusting, for each identified word template, the confidence score of the identified word template based on whether the one or more part of speech tags of the identified word template is similar to part-of-speech tags of word templates of the set of word templates.