US 11,675,754 B2
Systems and methods for universal reference source creation and accurate secure matching
Satyender Goel, Chicago, IL (US); and James B. Cushman, Longboat Key, FL (US)
Assigned to Collibra Belgium BV, Brussels (BE)
Filed by Collibra Belgium BV, Brussels (BE)
Filed on Nov. 24, 2020, as Appl. No. 17/103,751.
Prior Publication US 2022/0164324 A1, May 26, 2022
Int. Cl. G06F 16/215 (2019.01); G06F 16/2455 (2019.01); G06F 16/27 (2019.01); G06F 16/23 (2019.01); G06F 21/60 (2013.01); G06F 21/62 (2013.01)
CPC G06F 16/215 (2019.01) [G06F 16/2365 (2019.01); G06F 16/24558 (2019.01); G06F 16/27 (2019.01); G06F 21/602 (2013.01); G06F 21/6254 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A system for reference source matching without receiving identifiable personal information, comprising:
a memory configured to store non-transitory computer readable instructions; and
a processor communicatively coupled to the memory, wherein the processor, when executing the non-transitory computer readable instructions, is configured to:
receive at least one token record from a first source and at least one first attributes bit string associated with the at least one token record from the first source;
receive at least one token record from a second source and at least one second attributes bit string associated with the at least one token record from the second source;
wherein the first token record and the second token record each comprise masked personal data attributes;
compare the at least one token record from the first source to the at least one token record from the second source by comparing the at least one first attributes bit string to the at least one second attributes bit string;
generate at least one linked pair, wherein the linked pair indicates a degree of overlap of tokens associated with the at least one token record from the first source and the at least one token record from the second source;
based on the degree of overlap from the at least one linked pair, determine whether the at least one token record from the first source matches to the at least one token record from the second source;
based on the determination of a match of the at least one token record from the first source to the at least one token record from the second source, create a unique token record in a customer environment, wherein the unique token is a single token record representing the at least one token from the first source and the at least one token from the second source;
compare the unique token to at least one universal reference token repository; and
based on a determination that the unique token is not present in the at least one universal token repository, store the unique token in the universal reference token repository.