US 12,235,813 B2
Systems and methods for continuous data profiling
James B. Cushman, II, Longboat Key, FL (US); Vadim Vaks, Holland, PA (US); and Satyender Goel, Chicago, IL (US)
Assigned to Collibra Belgium BV, Brussels (BE)
Filed by Collibra Belgium BV, Brussels (BE)
Filed on Sep. 18, 2023, as Appl. No. 18/469,363.
Application 18/469,363 is a continuation of application No. 17/364,849, filed on Jun. 30, 2021, granted, now 11,782,889.
Prior Publication US 2024/0004849 A1, Jan. 4, 2024
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 16/215 (2019.01); G06F 16/22 (2019.01); G06F 16/2457 (2019.01); G06F 16/25 (2019.01)
CPC G06F 16/215 (2019.01) [G06F 16/2282 (2019.01); G06F 16/2457 (2019.01); G06F 16/252 (2019.01)] 17 Claims
OG exemplary drawing
 
1. A method of continuously profiling data, the method comprising:
receiving at least one input stream of data;
profiling the at least one input stream of data by,
identifying at least one attribute in the at least one input stream of data, wherein the at least one attribute is associated with a series of features,
determining a profiling score for the at least one attribute based on a source of the at least one input stream of data and an aggregation of the series of features,
determining the at least one attribute is indicative of an address, and
in response to determining the at least one attribute is indicative of the address, processing the at least one attribute through an address library engine that adds the at least one attribute to a library of addresses;
generating a profiled set of data based on the profiling of the at least one input stream of data, wherein the profiled set of data includes the profiling score for the at least one attribute; and
storing the profiled set of data in at least one client database.