| CPC G06F 16/24558 (2019.01) [G06F 16/137 (2019.01); G06F 16/152 (2019.01); G06F 16/219 (2019.01); G06F 16/2255 (2019.01); H04L 9/0643 (2013.01); H04L 9/3247 (2013.01)] | 17 Claims |

|
1. One or more computer-readable media storing instructions which, when executed by one or more hardware processors, cause the one or more hardware processors to perform operations comprising:
receiving a first data block comprising a plurality of records extracted from one or more database tables, each record having a plurality of significant fields and, complementary to the significant fields, one or more deprecated fields;
serializing the plurality of significant fields of the plurality of records to obtain a sequence of tokens in text format, each token being an atomic component of text having a semantic meaning, wherein the deprecated fields are common to each of the records and the serializing disregards the deprecated fields;
applying one or more hash functions to the sequence of tokens to compute a first signature over the plurality of records in the first data block, wherein the computation of the first signature disregards the deprecated fields;
receiving a request to compare the first data block with a second data block;
comparing the first signature with a second signature of the second data block;
responsive to a first determination that the first signature matches the second signature, returning a match indication in response to the request; and
responsive to a second determination that the first signature does not match the second signature:
tracing the second determination to specific mismatched records of the first data block; and
returning, in response to the request, a mismatch indication comprising identifiers of all records of the first data block not matching corresponding records of the second data block.
|