US 12,437,572 B2
Method and system for automated processing and continuous deployment of subpoena extraction model
William Watson, Long Beach, NY (US); Armineh Nourbakhsh, Pittsburgh, PA (US); and Sameena Shah, Scarsdale, NY (US)
Assigned to JPMORGAN CHASE BANK, N.A., New York, NY (US)
Filed by JPMorgan Chase Bank, N.A., New York, NY (US)
Filed on Nov. 21, 2022, as Appl. No. 17/991,321.
Prior Publication US 2024/0169753 A1, May 23, 2024
Int. Cl. G06V 30/416 (2022.01); G06F 16/93 (2019.01); G06F 40/284 (2020.01); G06Q 50/18 (2012.01)
CPC G06V 30/416 (2022.01) [G06F 16/93 (2019.01); G06F 40/284 (2020.01); G06Q 50/18 (2013.01)] 16 Claims
OG exemplary drawing
 
1. A method for preparing a response to a subpoena, the method being implemented by at least one processor, the method comprising:
receiving, by the at least one processor, a first subpoena;
extracting, by the at least one processor from the first subpoena, a first plurality of informational requests;
retrieving, by the at least one processor from a memory, a first data set;
analyzing, by the at least one processor, the first data set with respect to the first set of informational requests in order to identify items from within the first data set that are responsive to items included within the first set of informational requests;
generating, by the at least one processor, a report that includes a result of the analyzing; and
transmitting, by the at least one processor to a predetermined destination, the report,
wherein the analyzing comprises providing the first subpoena as an input to a Robustly optimized Bidirectional Encoder Representations from Transformers pre-training approach (ROBERTa) model that uses a sub-word approach to modeling words in a sequential format, and
wherein the ROBERTa model is generated by:
retrieving a plurality of second subpoenas for which responses have previously been generated;
extracting, from each respective second subpoena of the plurality of second subpoenas, a first set of tokens;
determining, for each respective token from within the first set of tokens, a corresponding set of spatial information that relates to a location of the respective token within the respective second subpoena, the corresponding set of spatial information including a pixel offset, a width, a height, a line number, a block association, and an extraction confidence; and
assigning, for each respective token from within the first set of tokens, a corresponding tag that indicates a type of information from among a predetermined set of information types.