US 12,293,598 B2
	Entity extraction via document image processing
Swati Tata, Bangalore (IN); Anjani Kumari, Jharkhand (IN); Abhishek Singh, Jharkhand (IN); Kavita V V Ganeshan, Mumbai (IN); Omar Razi, Bengaluru (IN); Prakhar Gupta, Meerut (IN); Achal Gambhir, Bangalore (IN); and Ranjan Sarmah, Assam (IN)
Assigned to ACCENTURE GLOBAL SOLUTIONS LIMITED, Dublin (IE)
Filed by ACCENTURE GLOBAL SOLUTIONS LIMITED, Dublin (IE)
Filed on Nov. 16, 2022, as Appl. No. 17/987,969.
Prior Publication US 2024/0161528 A1, May 16, 2024
Int. Cl. G06K 9/00 (2022.01); G06V 30/186 (2022.01); G06V 30/19 (2022.01); G06V 30/412 (2022.01)

CPC G06V 30/412 (2022.01) [G06V 30/186 (2022.01); G06V 30/19153 (2022.01); G06V 30/19167 (2022.01)]

20 Claims

13. A method of data processing comprising:

accessing an image of a document including a plurality of data units;

obtaining a document image by converting the document into an image format;

implementing a connected components process that analyzes the document image as a series of sub-graphs;

determining that the plurality of data units includes at least one floating image based on the connected components process;

disregarding the at least one floating image from further processing;

identifying serially, one of a structured data unit and unstructured floating text from a first masked image and a second masked image generated from the first masked image;

identifying corresponding regions of the document image including one or more of the structured data unit and the unstructured floating text;

obtaining optical character recognition (OCR) input from the corresponding document image regions including the one or more of the structured data unit and the unstructured floating text,

wherein the OCR input includes textual data obtained based on a semantic context derived from logical boundaries defined by the corresponding document image regions; and

generating machine-consumable data set including entities extracted from the OCR input.