US 12,217,497 B2
	Extracting key information from document using trained machine-learning models
Yakupitiyage Don Thanuja Samodhye Dharmasiri, Melbourne (AU); Xu Zhong, Melbourne (AU); Ahmed Ataallah Ataallah Abobakr, Geelong (AU); Hongtao Yang, Sydney (AU); Budhaditya Saha, Sydney (AU); Shaoke Xu, Sunnyvale, CA (US); Shashi Prasad Suravarapu, San Ramon, CA (US); Mark Edward Johnson, Castle Cove (AU); and Thanh Long Duong, Seabrook (AU)
Assigned to ORACLE INTERNATIONAL CORPORATION, Redwood Shores, CA (US)
Filed by Oracle International Corporation, Redwood Shores, CA (US)
Filed on Aug. 15, 2022, as Appl. No. 17/888,300.
Claims priority of provisional application 63/249,255, filed on Sep. 28, 2021.
Prior Publication US 2023/0095673 A1, Mar. 30, 2023
Int. Cl. G06V 10/82 (2022.01); G06V 30/148 (2022.01); G06V 30/412 (2022.01)

CPC G06V 10/82 (2022.01) [G06V 30/153 (2022.01); G06V 30/412 (2022.01)]

17 Claims

1. A method comprising:

receiving, at a data processing system, a set of data within a document, the set of data including one or more key fields, wherein the data processing system comprises a table detection module, a key information extraction module, and a table extraction module;

extracting, by the data processing system, text information and location data associated with the text information by performing an optical character recognition operation, the text information and the location data from the document;

detecting, by the table detection module, whether one or more tables are present in the document, wherein the table detection module is a trained object detection neural network trained using document images for outputting a plurality of pixel coordinates of one or more tables in the document images;

for each of the one or more tables determined to be within the document, detecting, by the table detection module, a location of each of the one or more tables included in the document;

extracting, by the key information extraction module and based on the text information and the location data, text from the one or more key fields;

extracting, by the table extraction module, each of the one or more tables determined to be within the document based on: (i) text information and location data associated with the text information, and (ii) the location of each of the one or more tables; and

outputting, by the data processing system, extraction results for the document, the extraction results including the text from the one or more key fields, and the one or more tables.