US 11,868,717 B2
Multi-page document recognition in document capture
Ming Fung Ho, Fremont, CA (US)
Assigned to Open Text Corporation, Waterloo (CA)
Filed by Open Text Corporation, Waterloo (CA)
Filed on Sep. 8, 2022, as Appl. No. 17/940,777.
Application 17/940,777 is a continuation of application No. 16/953,561, filed on Nov. 20, 2020, abandoned.
Application 16/953,561 is a continuation of application No. 16/290,453, filed on Mar. 1, 2019, granted, now 10,860,848, issued on Dec. 8, 2020.
Application 16/290,453 is a continuation of application No. 15/221,433, filed on Jul. 27, 2016, granted, now 10,248,858, issued on Apr. 2, 2019.
Application 15/221,433 is a continuation of application No. 13/720,671, filed on Dec. 19, 2012, granted, now 9,430,453, issued on Aug. 30, 2016.
Prior Publication US 2023/0005285 A1, Jan. 5, 2023
Int. Cl. G06F 40/226 (2020.01); G06F 40/174 (2020.01); G06V 30/413 (2022.01); G06V 30/416 (2022.01); G06V 30/10 (2022.01)
CPC G06F 40/226 (2020.01) [G06F 40/174 (2020.01); G06V 30/413 (2022.01); G06V 30/416 (2022.01); G06V 30/10 (2022.01)] 21 Claims
OG exemplary drawing
 
1. A method of capturing document data, comprising:
obtaining a multi-page document;
determining a document type based at least in part on the multi-page document;
creating an instance of a selected one of a plurality of type-specific data entry forms in a forms library based at least in part on the document type, wherein the instance of the data entry form is associated with the multi-page document;
populating the instance of the data entry form associated with the multi-page document based at least in part on the data associated with the multi-page document;
identifying, according to one or more validation rules, one or more fields of the instance of the data entry form for which validation of the corresponding data is required based at least in part on data extracted from a plurality of pages associated with the multi-page document, wherein data extracted from a first page of the plurality of pages is dependent on data extracted from a second page of the plurality of pages
performing automatic validation on the data extracted from the plurality of pages and associated with the one or more fields of the instance of the data entry form to generate a degree of confidence for values of the extracted data associated with the one or more fields;
identifying a subset of the values of the extracted data where a degree of confidence generated by the automatic validation is below a threshold, where each of the subset of values corresponds to a field of the one or more fields; and
providing, to the user, the identified subset of values and their corresponding fields of the instance of the data entry form for which validation of the corresponding values is required.