| CPC G06V 30/19173 (2022.01) [G06V 10/82 (2022.01); G06V 30/1448 (2022.01); G06V 30/148 (2022.01); G06V 30/1918 (2022.01); G06V 30/41 (2022.01)] | 10 Claims |

|
1. A method of classifying a document, comprising:
receiving, from a storage device, an image of the document;
obtaining a predefined user configuration indicative of a term of interest for the document;
converting, by a document importer, the image to machine-readable data using Optical Character Recognition (OCR);
performing, by a first convolutional neural network, semantic enrichment by highlighting the term of interest in the image based on the machine-readable data;
splitting, by a second convolutional neural network, the image into four quadrants for identifying a positional context of the term of interest in the quadrants, wherein the first convolutional neural network and the second convolutional neural network are a ResNet-152 model;
generating a model representation for each of the quadrants;
concatenating the model representations of the quadrants; and
classifying the image based on the concatenated model representations.
|