US 11,983,514 B2
Systems and methods for data transformation
Komal Krishna Pandamaneti, Frisco, TX (US); Sriram Srinivasan, Frisco, TX (US); Lavanya Raja, Frisco, TX (US); Sairam Kommireddy, Frisco, TX (US); Maria Rabinowitz, Arlington, VA (US); Steven W. Braskamp, Dallas, TX (US); Steven G. Chiagouris, Plano, TX (US); Jason E. Lines, Prosper, TX (US); Arjun Dugal, Dallas, TX (US); and Brittany Nicole Geron Courtney, Celina, TX (US)
Assigned to Capital One Services, LLC, McLean, VA (US)
Filed by Capital One Services, LLC, McLean, VA (US)
Filed on Sep. 29, 2022, as Appl. No. 17/936,510.
Prior Publication US 2024/0111503 A1, Apr. 4, 2024
Int. Cl. G06F 9/44 (2018.01); G06F 8/35 (2018.01); G06F 8/71 (2018.01)
CPC G06F 8/35 (2013.01) [G06F 8/71 (2013.01)] 15 Claims
OG exemplary drawing
 
1. A system for transforming data from a batch-based legacy software program for processing by a streaming-based new software program, the system comprising:
one or more processors; and
a non-transitory, computer-readable medium comprising instructions that, when executed by the one or more processors, cause operations comprising:
receiving batch information for a plurality of threads to be processed by a new software program, the batch information including for each thread input data and output data captured from a legacy software program during a particular time period;
generating, from the batch information, an information stream based on a data model for the new software program, the information stream including at least some of the input data for the plurality of threads, the data model for the new software program being different from a data model for the legacy software program;
processing, using the new software program, the information stream for the plurality of threads to generate corresponding output data for the particular time period;
comparing the output data from the new software program and the output data from the legacy software program for the particular time period;
based on the comparison, determining that at least one thread has a mismatch between the output data from the new software program and the output data from the legacy software program;
adjusting the data model for the new software program based on the mismatch for the at least one thread;
generating, from the batch information, an updated information stream based on the adjusted data model for the new software program; and
processing, using the new software program, the updated information stream for the plurality of threads to generate corresponding output data for the particular time period.