| CPC G06Q 20/102 (2013.01) [G06Q 30/04 (2013.01)] | 20 Claims |

|
1. A computer system for improving error tolerance in processing a plurality of input files, the computer system comprising:
at least one database; and
at least one processor communicatively coupled to the at least one database, the at least one processor programmed to:
receive the input files and distribute portions of each received input file to one of a plurality of node-cluster nodes;
perform an inbound process in parallel at each node-cluster node by:
receiving respective portions of the routed input files;
extracting input data records from the input files, each of the input data records comprising data for a plurality of input fields; and
storing the input data records in the at least one database;
store a plurality of destination records in the at least one database, each of the destination records including data for a plurality of destination fields, wherein the plurality of destination fields correspond to the plurality of input fields;
for each stored input data record, execute a matching engine to:
compare the data in the plurality of input fields to the data in the destination fields of each of the destination records;
generate a match score for each of the destination records based on the comparison of the data;
generate a list of candidate destination records by ranking the match score for each destination record; and
update the input data record stored in the at least one database to include the list of candidate destination records and the corresponding match scores; and
for each updated stored input data record, perform a core process in parallel at each node-cluster node by:
comparing, for each of the candidate destination records, a format of the data in at least one of the input fields to a mask associated with the corresponding destination field;
adjusting the match score of the candidate destination record based on the comparison to the mask;
generating an output record including the data from the destination fields of the candidate destination record having the highest adjusted match score; and
transmitting the output record to a gateway processor.
|