| CPC G06F 3/0482 (2013.01) [G06F 3/0484 (2013.01); G06F 16/93 (2019.01); G06F 40/106 (2020.01); G06F 40/30 (2020.01); G06N 20/00 (2019.01)] | 20 Claims |

|
1. A method for categorizing electronic documents, the method comprising:
receiving, by a processor, a plurality of electronic documents;
associating, by a plurality of trained machine learning models comprising a paragraph model trained to identify one or more categories associated with paragraphs of text and a sentence model trained to identify one or more subcategories of the one or more categories associated with sentences of text, a category and a subcategory for each of the plurality of electronic documents, the one or more categories corresponding to conceptual context of a content of the text;
identifying, by the processor, a conflict between a category and a subcategory associated with a first document of the plurality of electronic documents and a category and a subcategory associated with a second document of the plurality of documents;
removing, based on the identified conflict, an association of the category and the subcategory from the first document of the plurality of electronic documents; and
generating a graphical user interface comprising a navigable document image of the first document and the second document and a list of the associated category and the subcategory within the image of the first document and the second document.
|