US 12,147,421 B1
Method and system for identifying entities
David Cooke, Los Altos, CA (US); Martin Betz, Palo Alto, CA (US); Ashutosh Joshi, Fremont, CA (US); and Binay Mohanty, New Delhi (IN)
Assigned to Aurea Software, Inc., Austin, TX (US)
Filed by AUREA SOFTWARE, INC., Austin, TX (US)
Filed on Aug. 18, 2022, as Appl. No. 17/890,738.
Application 17/890,738 is a continuation of application No. 16/814,742, filed on Mar. 10, 2020, granted, now 11,455,301.
Application 16/814,742 is a continuation of application No. 15/971,492, filed on May 4, 2018, granted, now 10,606,831, issued on Mar. 31, 2020.
Application 15/971,492 is a continuation of application No. 14/311,179, filed on Jun. 20, 2014, granted, now 9,965,508, issued on May 8, 2018.
Application 14/311,179 is a continuation of application No. 13/274,287, filed on Oct. 14, 2011, granted, now 8,782,042, issued on Jul. 15, 2014.
Int. Cl. G06F 7/02 (2006.01); G06F 16/00 (2019.01); G06F 16/245 (2019.01); G06F 16/26 (2019.01); G06F 16/35 (2019.01); G06F 16/93 (2019.01); G06F 16/9535 (2019.01); G06F 40/295 (2020.01)
CPC G06F 16/245 (2019.01) [G06F 16/26 (2019.01); G06F 16/35 (2019.01); G06F 16/353 (2019.01); G06F 16/93 (2019.01); G06F 16/9535 (2019.01); G06F 40/295 (2020.01)] 12 Claims
OG exemplary drawing
 
1. A system for identifying an entity having specified entity attribute in a document, the system comprising:
one or more processors; and
a memory, coupled to the one or more processors, storing code that when executed by the one or more processors causes the one or more processors to perform operations comprising:
receiving, from each process of a plurality of processes, a corresponding set of candidate identity attributes that are each for identifying a particular entity having the specified entity attribute in the document, wherein each process of the plurality of processes generates the corresponding set of candidate identity attributes based on the specified entity attribute in the document;
calculating a score for each candidate identity attribute in the set of candidate identity attributes, wherein calculating the score for a particular candidate identity attribute comprises (1) identifying a set of tokens in the particular candidate identity attribute, (2) assigning a value to each token in the sets of tokens based on a token count that represents a number of instances of the token across the set of candidate identity attributes and (3) calculating the score based on the assigned values; and
identifying, based on the scores calculated for the candidate identity attributes, an identity attribute from the set of candidate identity attributes that identifies the entity having the specified entity attribute in the document.