US 12,001,951 B2
Automated contextual processing of unstructured data
Kavita V V Ganeshan, Mumbai (IN); Swati Tata, Bangalore (IN); Soujanya Soni, Bangalore (IN); Madhur Bhasini Chaini, Bangalore (IN); Anjani Kumari, Jharkhand (IN); Omar Razi, Bangalore (IN); Thyagarajan Delli, Bangalore (IN); Ullas Balan Nambiar, Bangalore (IN); Guanglei Xiong, Pleasanton, CA (US); Sivasubramanian Arumugam Jalajam, Chennai (IN); Srinivasan Krishnan Rajagopalan, Chennai (IN); Venkatesan Kamalakannan, Chennai (IN); and Harbhajan Singh, Chennai (IN)
Assigned to ACCENTURE GLOBAL SOLUTIONS LIMITED, Dublin (IE)
Filed by ACCENTURE GLOBAL SOLUTIONS LIMITED, Dublin (IE)
Filed on Mar. 23, 2021, as Appl. No. 17/210,153.
Prior Publication US 2022/0309332 A1, Sep. 29, 2022
Int. Cl. G06N 3/08 (2023.01); G06F 16/35 (2019.01); G06F 18/22 (2023.01); G06F 40/247 (2020.01); G06N 3/04 (2023.01); G06V 30/18 (2022.01); G06V 30/262 (2022.01); G06V 30/40 (2022.01)
CPC G06N 3/08 (2013.01) [G06F 16/353 (2019.01); G06F 18/22 (2023.01); G06F 40/247 (2020.01); G06N 3/04 (2013.01); G06V 30/18057 (2022.01); G06V 30/262 (2022.01); G06V 30/40 (2022.01)] 18 Claims
OG exemplary drawing
 
1. A system for automated contextual processing, the system comprising:
a processor;
a data trainer coupled to the processor, the data trainer to:
classify, using a classification model, a plurality of extracted parameters from a set of digitized training documents, wherein the classification is performed to assign a document similarity score with respect to a set of reference documents corresponding to a plurality of domains;
detect automatically, a domain for the set of digitized training documents based on the document similarity score; and
load a domain based neural model for the detected domain to generate a plurality of pre-defined contextual parameters specific to the detected domain, the plurality of pre-defined contextual parameters being obtained by extraction of multiple queries from the set of digitized training documents and subsequent processing of the extracted queries;
a contextual processing engine of the processor, the engine to:
receive a set of input documents obtained by digitization of a non-digital documents;
perform, through an AI model, a contextual processing of the received set of input documents based on the pre-defined contextual parameters to obtain an output in form of a plurality of filtered snippets each bearing a corresponding rank, the contextual processing comprising context building, context search and context-based ranking of one or more snippets extracted from the input documents; and
wherein a context-based verification of the unstructured data is performed based on the plurality of filtered snippets and the corresponding rank; and
a hybrid ensemble coupled to the processor, wherein the hybrid ensemble is configured to:
receive, from the contextual processing engine, a first data comprising the plurality of filtered snippets;
receive, from a learning engine coupled to the processor, a second data comprising an updated plurality of filtered snippets;
classify, using one or more models, the first data and the second data in a pre-defined format;
assign, using the one or more models, a pre-defined weight to each of the classified first data and the classified second data;
determine, using the one or more models, a similarity score based on the assigned weights; and
update, using the one or more models, the rank of each snippet in the plurality of filtered snippets to assign an updated rank.