US 11,727,702 B1
Automated indexing and extraction of information in digital documents
Julia Penfield, Seattle, WA (US); Aatish Suman, Austin, TX (US); Veeru Talreja, Morgantown, WV (US); and Misbah Zahid Khan, Mississauga (CA)
Assigned to VelocityEHS Holdings, Inc., Chicago, IL (US)
Filed by Velocity EHS Inc., Chicago, IL (US)
Filed on Jan. 17, 2023, as Appl. No. 18/98,055.
Int. Cl. G06V 10/82 (2022.01); G06V 30/24 (2022.01); G06F 40/205 (2020.01); G06F 40/284 (2020.01); G06F 40/258 (2020.01); G06F 40/295 (2020.01); G06V 30/19 (2022.01); G06V 30/413 (2022.01)
CPC G06V 30/2528 (2022.01) [G06F 40/205 (2020.01); G06F 40/258 (2020.01); G06F 40/284 (2020.01); G06F 40/295 (2020.01); G06V 10/82 (2022.01); G06V 30/19147 (2022.01); G06V 30/413 (2022.01)] 20 Claims
OG exemplary drawing
 
1. A computer implemented method to automatically index targeted information in a digital document, the method comprising:
selecting a page number of a digital document to identify a page containing targeted information;
inputting an image of the page into a visual machine learning network (visual ML), wherein the visual ML is trained to recognize text associated with the targeted information in the image;
identifying by the visual ML, a section of the image that contains the targeted information;
inputting the page number, the digital document, and coordinates of the section into an extraction module; and
extracting the targeted information by the extraction module from the section.