US 12,111,838 B2
Systems and methods for expedited large file processing
Japan Bhatt, Plano, TX (US)
Assigned to CAPITAL ONE SERVICES, LLC, McLean, VA (US)
Filed by Capital One Services, LLC, McLean, VA (US)
Filed on Apr. 10, 2023, as Appl. No. 18/297,957.
Application 18/297,957 is a continuation of application No. 17/201,311, filed on Mar. 15, 2021, granted, now 11,625,408.
Application 17/201,311 is a continuation of application No. 16/233,796, filed on Dec. 27, 2018, granted, now 10,949,433, issued on Mar. 16, 2021.
Application 16/233,796 is a continuation of application No. 15/905,163, filed on Feb. 26, 2018, granted, now 10,191,952, issued on Jan. 29, 2019.
Application 15/905,163 is a continuation of application No. 15/659,143, filed on Jul. 25, 2017, granted, now 9,934,287, issued on Apr. 3, 2018.
Prior Publication US 2023/0244680 A1, Aug. 3, 2023
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 16/2458 (2019.01); G06F 16/18 (2019.01); G06F 16/188 (2019.01); G06F 16/215 (2019.01); G06F 16/28 (2019.01)
CPC G06F 16/2458 (2019.01) [G06F 16/188 (2019.01); G06F 16/215 (2019.01); G06F 16/285 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A system for processing a large file, comprising:
one or more processors; and
a memory in communication with the one or more processors and storing instructions that, when executed by the one or more processors, are configured to cause the system to:
receive record data comprising a plurality of records, each of the plurality of records having a data format comprising a sequence of characters;
determine, based on a comparison of a size of the record data to a predetermined size threshold, an order of magnitude for a seed portion;
determine, based on the data format, a plurality of unique focus values, each of the plurality of unique focus values corresponding to a sub-group of the plurality of records, wherein each of the plurality of unique focus values correspond to a specified portion of the sequence of characters in the data format, a number of the plurality of unique focus values being based on the order of magnitude of the seed portion;
create a plurality of virtual processing units, each associated with a unique one of the plurality of unique focus values; and
process, by each of the plurality of virtual processing units, the corresponding sub-group of the plurality of records that corresponds to the focus value associated with the respective virtual processing unit.