US 12,243,340 B2
System and method for domain aware document classification and information extraction from consumer documents
Jatin Agrawal, San Francisco, CA (US); Ashwin Kannan, San Ramon, CA (US); Harshil Prajapati, San Francisco, CA (US); Bharath Rengarajan, Foster City, CA (US); Eric Harvey, San Leandro, CA (US); and Emma Wei, Cupertino, CA (US)
Assigned to Informed, inc., Belvedere Tiburon, CA (US)
Filed by Informed, Inc., San Francisco, CA (US)
Filed on Feb. 28, 2024, as Appl. No. 18/590,472.
Application 18/590,472 is a continuation of application No. 17/134,780, filed on Dec. 28, 2020, granted, now 11,928,878.
Claims priority of provisional application 63/123,732, filed on Dec. 10, 2020.
Claims priority of provisional application 63/070,678, filed on Aug. 26, 2020.
Prior Publication US 2024/0203149 A1, Jun. 20, 2024
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 17/00 (2019.01); G06F 40/226 (2020.01); G06F 40/289 (2020.01); G06F 40/30 (2020.01); G06V 30/412 (2022.01); G06V 30/414 (2022.01); G06V 30/416 (2022.01); G06V 30/10 (2022.01)
CPC G06V 30/416 (2022.01) [G06F 40/226 (2020.01); G06F 40/289 (2020.01); G06F 40/30 (2020.01); G06V 30/412 (2022.01); G06V 30/414 (2022.01); G06V 30/10 (2022.01)] 20 Claims
OG exemplary drawing
 
1. A computer-implemented method comprising:
establishing, by use of a data processor and a data network, a data connection with at least one applicant platform;
receiving an upload of documents from the applicant platform via the data network;
classifying each document as being of a particular document type;
determining an information extraction strategy based on a document type classification of a particular document, the information extraction strategy including performing extractions from the document based on the document type classification and a document context, the document context including an inference of document content drawn from document content extracted from a plurality of related documents, the document context further including applying domain-specific rules of a domain-aware business layer to bias document content extraction toward compliance with the domain-specific rules; and
extracting information from the particular document based on the information extraction strategy and the document context.