| CPC G06F 16/2228 (2019.01) [G06F 16/2365 (2019.01)] | 10 Claims |

|
1. A system for disambiguating attributes associated to one or more entities, the system comprising: an entity database; an entity disambiguation computer comprising a memory, a processor, and a plurality of programming instructions, the plurality of programming instructions when executed by the processor cause the processor to: receive by, and store in, the entity database, information associated with a candidate entity among one or more entities in the entity database at pre-defined intervals; extract and store in an attributes database, one or more attributes associated with the candidate entity;
wherein the one or more attributes comprises a location, a geocode, an entity name, a stock symbol, a registered entity identity, an entity classification code, an entity uniform resource links (URLs), an employee data, an entity event, a technology domain, an entity group connection, an entity brand, and a competitor;
generate a linkage data instance for each pair of entities, the linkage data instance comprising a set of attribute value pairs, each attribute value pair including values for a specific attribute from each entity of the pair; create and store, in the entity database, for each attribute among the one or more attributes, a plurality of timeslice objects, wherein the plurality of timeslice objects are valid for respective durations, and wherein the entity disambiguation computer generates the plurality of timeslice objects based on the one or more attributes extracted from the one or more entities; associate the plurality of timeslice objects with one or more indices based on the respective durations of the plurality of timeslice objects, wherein the plurality of timeslice objects and the respective one or more indices are arranged based on timelines associated with the plurality of timeslice objects; calculate a similarity metric for each attribute value pair in the linkage data instance by applying a comparison algorithm that determines a similarity score based on a data type of the attribute; determine a position of a new timeslice object of the plurality of timeslice objects with respect to positions of other timeslice objects of the plurality of timeslice objects, wherein the new timeslice object is a latest created timeslice object; update the arrangement of the plurality of timeslice objects based on the position of the new timeslice object; determine if the two entities are associated with a same entity based on the calculated similarity metrics in the linkage data instance exceeding a predetermined threshold; responsive to determining that the two entities are associated with the same entity, merge the timeslice objects for the two entities into a single resolved entity object; and store the resolved entity object in the entity database, wherein the entity database provides a single, unified view of entities without ambiguity.
|