US 11,782,889 B2
Systems and methods for continuous data profiling
James B. Cushman, II, Longboat Key, FL (US); Vadim Vaks, Holland, PA (US); and Satyender Goel, Chicago, IL (US)
Assigned to Collibra Belgium BV, Brussels (BE)
Filed by Collibra Belgium BV, Brussels (BE)
Filed on Jun. 30, 2021, as Appl. No. 17/364,849.
Prior Publication US 2023/0004539 A1, Jan. 5, 2023
Int. Cl. G06F 16/215 (2019.01); G06F 16/22 (2019.01); G06F 16/2457 (2019.01); G06F 16/25 (2019.01)
CPC G06F 16/215 (2019.01) [G06F 16/2282 (2019.01); G06F 16/2457 (2019.01); G06F 16/252 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A system for continuously profiling data, comprising:
a memory configured to store non-transitory computer readable instructions; and
a processor communicatively coupled to the memory, wherein the processor, when executing the non-transitory computer readable instructions, is configured to:
receive a first input stream of data;
profile the first input stream of data by,
identifying at least one attribute in the first input stream of data, wherein the at least one attribute is associated with a series of features, and
determining a profiling score for the at least one attribute based on an aggregation of the series of features;
generate a first profiled set of data based on the profiling of the first input stream of data, wherein the first profiled set of data includes the profiling score for the at least one attribute;
receive a second input steam of data;
compare the second input stream of data to at least one profile trigger;
initiate the at least one profile trigger;
profile the second input stream of data;
generate a second profiled set of data based on the profiling of the second input stream of data; and
store the first profiled set of data and the second profiled set of data in at least one client database.