US 11,734,782 B2
	Automated document analysis for varying natural languages
William Michael Edmund, Lynnwood, WA (US); John E. Bradley, III, Duvall, WA (US); and Daniel Crouse, Seattle, WA (US)
Assigned to AON RISK SERVICES, INC. OF MARYLAND, New York, NY (US)
Filed by AON RISK SERVICES, INC. OF MARYLAND, New York, NY (US)
Filed on Feb. 23, 2022, as Appl. No. 17/678,703.
Application 17/678,703 is a continuation of application No. 16/523,562, filed on Jul. 26, 2019, granted, now 11,263,714.
Application 16/523,562 is a continuation of application No. 15/451,138, filed on Mar. 6, 2017, granted, now 10,366,461, issued on Jul. 30, 2019.
Prior Publication US 2022/0343445 A1, Oct. 27, 2022
This patent is subject to a terminal disclaimer.
Int. Cl. G06Q 50/18 (2012.01); G06F 40/137 (2020.01); G06F 40/247 (2020.01); G06F 40/253 (2020.01); G06F 40/263 (2020.01); G06F 40/284 (2020.01)

CPC G06Q 50/184 (2013.01) [G06F 40/137 (2020.01); G06F 40/247 (2020.01); G06F 40/253 (2020.01); G06F 40/263 (2020.01); G06F 40/284 (2020.01)]

20 Claims

1. A computer-implemented method comprising:

receiving documents containing text written in a type of natural language, individual ones of the documents associated with a generated document identification number;

generating one or more document portions for the individual ones of the documents;

generating a word count for individual ones of the document portions;

identifying a referential word count;

generating a word count ratio for individual ones of the document portions based at least in part on the referential word count and the word count for individual ones of the document portions;

determining a word frequency for the individual ones of the words included in the document portions;

generating a commonness score for the individual ones of the document portions based at least in part on the word frequency for the individual ones of the words in the document portions;

identifying a document portion of the document portions having a commonness score representing a highest commonness score of the individual ones of the document portions;

generating a commonness score ratio for the individual ones of the document portions by dividing the commonness score representing the highest commonness score by the commonness score for the individual ones of the document portions;

generating an overall score for the individual ones of the document portions based at least in part on the word count ratio and the commonness score ratio for the individual ones of the document portions; and

generating a user interface including at least one overall score for one of the document portions in proximity to the generated document identification number associated with the one of the document portions.