US 12,254,028 B2
Systems and methods for document partitioning and partition labeling
Saurabh Tiwari, Bangalore (IN); Tuhin Bhura, Kolkata (IN); Shashanka Narayan, Chicago, IL (US); Vikas Gautam, Bengaluru (IN); Ashwarya Gupta, Bangalore (IN); and Ponnappa Ponjanda Appaiah, Mysore (IN)
Assigned to JPMORGAN CHASE BANK, N.A., New York, NY (US)
Filed by JPMORGAN CHASE BANK, N.A., New York, NY (US)
Filed on Aug. 29, 2023, as Appl. No. 18/458,066.
Claims priority of application No. 202311047328 (IN), filed on Jul. 13, 2023.
Prior Publication US 2025/0021580 A1, Jan. 16, 2025
Int. Cl. G06F 16/00 (2019.01); G06F 16/22 (2019.01); G06F 16/27 (2019.01)
CPC G06F 16/278 (2019.01) [G06F 16/2264 (2019.01)] 7 Claims
OG exemplary drawing
 
1. A method performed by a partitioning and labeling platform, comprising:
receiving a document file at a data capture service;
passing at least a portion of the document file as a byte stream to a unit extraction service, the unit extraction service performing steps of:
standardizing the byte stream into a string file;
determining a first logical partition separation indicator in the string file, wherein the first logical partition separation indicator is for a first partition level in a partition hierarchy, the first logical partition separation indicator comprising a first text separator;
setting a first variable value to a value of the first logical partition separation indicator;
determining a second logical partition separation indicator in the string file, wherein the second logical partition separation indicator is for a second partition level in the partition hierarchy, the second logical partition separation indicator comprising a second text separator;
setting a second variable value to a value of the second logical partition separation indicator;
determining a third logical partition separation indicator in the string file, wherein the third logical partition separation indicator is for the first partition level in the partition hierarchy, the third logical partition separation indicator comprising a third text separator;
writing the first variable value to a data structure, wherein the data structure has a standardized format;
writing the second variable value to the data structure;
persisting the data structure to a search index; and
clearing the first variable value and the second variable value.