US 12,130,777 B2
Systems and methods for performant data matching
Curtiss W. Schuler, Glendale, AZ (US); Brett A. Norris, North Vancouver (CA); and Satyender Goel, Chicago, IL (US)
Assigned to Collibra Belgium BV, Brussels (BE)
Filed by Collibra Belgium BV, Brussels (BE)
Filed on Jun. 14, 2023, as Appl. No. 18/334,965.
Application 18/334,965 is a continuation of application No. 17/369,798, filed on Jul. 7, 2021, granted, now 11,693,821.
Prior Publication US 2023/0325351 A1, Oct. 12, 2023
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 16/14 (2019.01); G06F 16/13 (2019.01)
CPC G06F 16/152 (2019.01) [G06F 16/137 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A method of data matching, the method comprising:
determining, by simple weighting, that at least one token record from a first source and at least one token record from a second source satisfy at least one token set rule, wherein the at least one token set rule is based on a presence of at least one common token;
generating a token set by merging the at least one token record from the first source with the at least one token record from the second source;
comparing the token set to at least one token record from a third source; and
retokenizing, by at least one hashing roll-up function, the at least one token record from the third source into the token set.