CPC G06F 16/285 (2019.01) [G06F 16/2272 (2019.01); G06F 16/2379 (2019.01)] | 20 Claims |
1. A method, comprising:
configuring a data processing job for a tenant of a data processing platform, wherein configuring the data processing job comprises receiving a selection of a scheduled trigger condition for the data processing job, a set of matching criteria to use for identifying duplicate data records, and a plurality of data records to check for the duplicate data records; and
initiating the data processing job for the tenant of the data processing platform based at least in part on the scheduled trigger condition being satisfied, wherein the data processing job comprises:
identifying the duplicate data records in the plurality of data records based at least in part on the set of matching criteria configured for the tenant of the data processing platform, the duplicate data records comprising a first set of fields with matching data and a second set of fields with conflicting data;
receiving a selection of a set of merging criteria to use for combining the duplicate data records into a set of merged data records; and
merging the duplicate data records into the set of merged data records based at least in part on the set of merging criteria configured for the tenant of the data processing platform, wherein merging the duplicate data records comprises selecting, from the conflicting data of the duplicate data records, data to include in one or more fields of the set of merged data records.
|