| CPC G06F 40/169 (2020.01) [G06F 16/93 (2019.01); G06Q 30/018 (2013.01); G06V 30/416 (2022.01)] | 19 Claims |

|
1. A method, comprising:
receiving, by a computer system, a document from a requestor application;
extracting layout information and text from the document;
extracting, based on the layout information, values of one or more data items from the text of the document;
producing a document validation result by analyzing the one or more data items and performing metadata validation by applying a metadata validation rule specifying an expected result of comparing a first metadata item associated with the document and the one or more data items extracted from the document;
embedding, into the document, one or more human-readable comments reflecting the document validation result;
associating, with the document, a second metadata item reflecting the document validation result;
forwarding, to the requestor application, the document comprising the one or more human readable comments; and
generating a data structure comprising one or more records, wherein a first record of the one or more records comprises a portion of the text and a layout tag associated with the portion of the text.
|