US 12,229,190 B2
Machine for analysis of entity resolution data graphs using peer data structures
W. Dwayne Collins, Conway, AR (US)
Assigned to LiveRamp, Inc., San Francisco, CA (US)
Appl. No. 18/020,109
Filed by LiveRamp, Inc., San Francisco, CA (US)
PCT Filed Apr. 29, 2021, PCT No. PCT/US2021/029960
§ 371(c)(1), (2) Date Feb. 7, 2023,
PCT Pub. No. WO2022/035477, PCT Pub. Date Feb. 17, 2022.
Claims priority of provisional application 63/109,183, filed on Nov. 3, 2020.
Claims priority of provisional application 63/070,911, filed on Aug. 27, 2020.
Claims priority of provisional application 63/063,791, filed on Aug. 10, 2020.
Prior Publication US 2023/0259557 A1, Aug. 17, 2023
Int. Cl. G06F 16/901 (2019.01)
CPC G06F 16/9024 (2019.01) 19 Claims
OG exemplary drawing
 
1. A machine for analyzing a subject entity resolution data graph against peer data structures, the subject entity resolution data graph comprising a plurality of asserted relationships, each asserted relationship comprising a set of entity data for a particular entity and connections between the entity data, the machine comprising:
at least one processor; and
at least one memory that stores executable instructions that, when executed by the at least one processor, facilitates performance of operations, the operations comprising:
screening a plurality of peer entity resolution data structures to produce a set of trustworthiness metrics, wherein the set of trustworthiness metrics are applied to produce a plurality of candidate data structures, wherein the plurality of candidate data structures form a subset of the plurality of peer entity resolution data structures;
creating combinations of each one of the plurality of candidate data structures with each other one of the plurality of candidate data structures and submitting each combination to an evolutionary analysis process to produce a plurality of oracles, wherein the plurality of oracles form a subset of the plurality of candidate data structures; and
deriving asserted relationships from each of the plurality of candidate data structures, matching those candidate data structure asserted relationships against the subject entity resolution data graph to produce a set of matching results by partitioning the subject entity resolution data graph into disjoint significantly smaller subsets, wherein each subset is self-contained, and wherein the partitioning starts with aggregation of entities sharing a best postal address, then expands to fields providing a general locality index, and wherein when the partitioning converges a feedback loop is performed in terms of a set of defining similarity indices to determine whether a small number of adjustments to partition elements is needed, and then performing data-level and entity-level evaluation against the set of matching results to produce a set of quality metrics for the subject entity resolution data graph.