US 12,217,274 B2
System and method for implementing automated harvesting of scanned contracts
Ilya Slavin, Allen, TX (US); Wilson De Sousa Ferreira, Jr., New York, NY (US); and Fernando Nitz, Buenos Aires (AR)
Assigned to JPMorgan Chase Bank, N.A., New York, NY (US)
Filed by JPMorgan Chase Bank, N.A., New York, NY (US)
Filed on Aug. 27, 2021, as Appl. No. 17/446,201.
Prior Publication US 2023/0064537 A1, Mar. 2, 2023
Int. Cl. G06Q 30/02 (2023.01); G06F 16/951 (2019.01); G06F 16/957 (2019.01); G06F 40/20 (2020.01); G06Q 30/0201 (2023.01)
CPC G06Q 30/0201 (2013.01) [G06F 16/951 (2019.01); G06F 16/9574 (2019.01); G06F 40/20 (2020.01)] 20 Claims
OG exemplary drawing
 
1. A system that implements an automated harvesting engine, the system comprising:
a memory component that stores project data;
an interface that interacts with a project management system that comprises a compilation of scans of documents pertaining to one or more projects; and
an automated harvesting engine that comprises a computer processor and is coupled to the memory component and the interface, wherein the computer processor is further configured to perform operations comprising:
logging into, through a browser automation tool executing a programmable web browser, the project management system through a series of interfaces for a project of interest, wherein the browser automation tool emulates human user interactions with the programmable web browser by injecting irregular pauses between steps in order to perform operations including logging into the project management system, and wherein the project management system prevents pre-engineered anchoring;
parsing a codebase of the project management system to determine appropriate website anchors and link names for each login session, wherein the project management system continuously creates website anchors with different names on every login;
accessing cached metadata associated with the project of interest;
based on the cached metadata, executing one or more searches across multiple workspaces for the project of interest;
locating a workspace within the project management system for the project of interest, wherein the workspace comprises a folder structure with multiple layers of folders and subfolders;
using heuristics, accessing one or more scans for the project of interest within the project management system;
using the programmable web browser, initiating a download of the one or more scans from the project management system;
storing the one or more scans in a database; and
interacting with a market data analytics tool to perform analytics associated with the one or more scans.