US 11,669,301 B2
Effectively fusing database tables
Stephen Meyles, Seattle, WA (US); Yan Yan, Seattle, WA (US); Carlos Sakoda, Seattle, WA (US); Ian Wesley-Smith, Seattle, WA (US); and Dan Suciu, Seattle, WA (US)
Assigned to AMPERITY, INC., Seattle, WA (US)
Filed by AMPERITY, INC., Seattle, WA (US)
Filed on Nov. 25, 2020, as Appl. No. 17/104,868.
Application 17/104,868 is a continuation of application No. 15/729,931, filed on Oct. 11, 2017, granted, now 10,853,033.
Prior Publication US 2021/0081171 A1, Mar. 18, 2021
Int. Cl. G06F 7/02 (2006.01); G06F 16/00 (2019.01); G06F 7/14 (2006.01); G06F 16/2455 (2019.01); G06F 16/215 (2019.01); G06F 16/23 (2019.01); G06F 16/242 (2019.01)
CPC G06F 7/14 (2013.01) [G06F 16/215 (2019.01); G06F 16/2365 (2019.01); G06F 16/244 (2019.01); G06F 16/24556 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A method comprising:
de-duplicating a first database table and a second database table to generate a first de-duplicated database table and a second de-duplicated database table, the de-duplicating for a respective database table comprising:
performing pairwise comparisons on the respective database table to determine related pairs of records stored in the respective database table having a degree of similarity that exceeds a preset threshold,
identifying clusters based on the pairwise comparisons, and
consolidating redundant records in the respective database table using the clusters to generate a respective deduplicated database table;
performing third pairwise comparisons by comparing the first de-duplicated database table and second de-duplicated database table; and
generating a merged database table based on the third pairwise comparisons.