US 11,720,961 B2
Validation method and system to improve data accuracy
Ari Gross, West Hempstead, NY (US); Matthew Joshua Khan Persad, Brooklyn, NY (US); Yunhao Shi, New Hyde Park, NY (US); Perry Kangoun, New York, NY (US); and Talya Klein, Flushing, NY (US)
Assigned to SOFTWORKS AI, LLC, Forest Hills, NY (US)
Filed by SoftWorks AI, LLC, Forest Hills, NY (US)
Filed on Aug. 30, 2021, as Appl. No. 17/460,640.
Claims priority of provisional application 63/072,360, filed on Aug. 31, 2020.
Prior Publication US 2022/0067828 A1, Mar. 3, 2022
Int. Cl. G06Q 40/03 (2023.01); G06F 16/93 (2019.01); G06V 30/40 (2022.01); G06V 30/148 (2022.01)
CPC G06Q 40/03 (2023.01) [G06F 16/93 (2019.01); G06V 30/153 (2022.01); G06V 30/40 (2022.01)] 10 Claims
OG exemplary drawing
 
1. An automated document-analysis method comprising the steps of:
automatically selecting a data field in an electronic document;
automatically identifying each occurrence of the selected data field in the electronic document;
for each occurrence of the selected data field in the electronic document automatically obtaining a data value and a confidence value;
grouping the identified occurrences of the selected data field into a set of data-field groups based on their respective data values;
for each data-field group in the set of data-field groups, determining a number of occurrences of the selected data field and the data-field group's average confidence value;
applying a first set of criteria to each data-field group in the set of data-field groups, the first set of criteria comprising at least one of
(i) a number of group-members, and
(ii) a confidence value threshold;
based on a result of the applying step, designating one data-field group in the set of data-field groups as a global-value group, increasing the average confidence value of the global-value group;
for an at least one occurrence of the data field in the global-value group, increase the data field's confidence value;
obtaining a set of modification criteria;
applying a set of modification criteria to each occurrence of a data field in a first data-field group, other than global-value group, in the set of data-field groups; and
depending on a result of applying the set of modification criteria, changing an at least one of the data value and confidence value of an at least one data field in the first data-field group to a corresponding value of a data field in the global-value group.