US 12,266,204 B2
Information processing apparatus, image forming apparatus, and information processing method for automatically ordering page
Hidenori Shoji, Concord, CA (US)
Assigned to KYOCERA Document Solutions Inc., Osaka (JP)
Filed by KYOCERA Document Solutions Inc., Osaka (JP)
Filed on Jun. 27, 2022, as Appl. No. 17/851,020.
Prior Publication US 2023/0419713 A1, Dec. 28, 2023
Int. Cl. G06V 30/416 (2022.01); G06V 30/244 (2022.01); G06V 30/413 (2022.01)
CPC G06V 30/416 (2022.01) [G06V 30/245 (2022.01); G06V 30/413 (2022.01)] 12 Claims
OG exemplary drawing
 
1. An information processing apparatus for ordering a plurality of page data that is scanned, comprising:
an OCR unit that performs optical character recognition for character and layout in a page for each of the plurality of page data;
a rule order unit configured to classify each of the plurality of page data based on a page ordering rule according to the characters and the layout that are recognized by optical character recognition by the OCR unit, extract a page number, and calculate certainty of the page number; and
an ML order unit configured to classify page data of a page with low certainty calculated by the rule order unit by machine learning and infer the page number;
wherein
the rule order unit is configured to calculate the certainty by similarity of the layout, similarity of font, and the similarity of extraction result of the page number.