CPC G06F 40/226 (2020.01) [G06F 40/174 (2020.01); G06V 30/413 (2022.01); G06V 30/416 (2022.01); G06V 30/10 (2022.01)] | 21 Claims |
1. A method of capturing document data, comprising:
obtaining a multi-page document;
determining a document type based at least in part on the multi-page document;
creating an instance of a selected one of a plurality of type-specific data entry forms in a forms library based at least in part on the document type, wherein the instance of the data entry form is associated with the multi-page document;
populating the instance of the data entry form associated with the multi-page document based at least in part on the data associated with the multi-page document;
identifying, according to one or more validation rules, one or more fields of the instance of the data entry form for which validation of the corresponding data is required based at least in part on data extracted from a plurality of pages associated with the multi-page document, wherein data extracted from a first page of the plurality of pages is dependent on data extracted from a second page of the plurality of pages
performing automatic validation on the data extracted from the plurality of pages and associated with the one or more fields of the instance of the data entry form to generate a degree of confidence for values of the extracted data associated with the one or more fields;
identifying a subset of the values of the extracted data where a degree of confidence generated by the automatic validation is below a threshold, where each of the subset of values corresponds to a field of the one or more fields; and
providing, to the user, the identified subset of values and their corresponding fields of the instance of the data entry form for which validation of the corresponding values is required.
|