US 12,292,866 B2
Data unification
Meiyalagan Balasubramanian, Redmond, WA (US); Lengning Liu, Redmond, WA (US); Aditya Kuppa, Redmond, WA (US); Kirk Hartmann Freiheit, Orlando, FL (US); Kalen Wong, Redmond, WA (US); Paula Budig Greve, Monroe, WA (US); Patrick Clinton Little, Seattle, WA (US); Lucas Pritz, Redmond, WA (US); Yue Wang, Bellevue, WA (US); Vivek Ravindranath Narasayya, Redmond, WA (US); Katchaguy Areekijseree, Woodinville, WA (US); Yeye He, Bellevue, WA (US); Surajit Chaudhuri, Kirkland, WA (US); and Gaurav Ghosh, Bellevue, WA (US)
Assigned to Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed by Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed on Jun. 7, 2023, as Appl. No. 18/331,169.
Application 18/331,169 is a continuation of application No. 17/490,908, filed on Sep. 30, 2021, granted, now 11,714,790.
Prior Publication US 2023/0315701 A1, Oct. 5, 2023
Int. Cl. G06F 16/215 (2019.01); G06F 16/2455 (2019.01)
CPC G06F 16/215 (2019.01) [G06F 16/24556 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A method of data unification, the method comprising:
receiving a conflation plan;
building an index for each partition of a plurality of indexed partitions;
receiving a first plurality of data records;
generating an updated first plurality of data records by performing a data unification process based on the conflation plan for the first plurality of data records, wherein the data unification process comprises performing an in-memory clustering for each partition of the plurality of indexed partitions in parallel;
receiving a second plurality of data records;
generating a plurality of merged data records by merging the second plurality of data records and the updated first plurality of data records;
generating a stable identifier (stableID) for each data record of the plurality of merged data records; and
updating each index of each partition of the plurality of indexed partitions with the plurality of merged data records.