| CPC G06V 30/416 (2022.01) [G06V 30/245 (2022.01); G06V 30/413 (2022.01)] | 12 Claims | 

| 
               1. An information processing apparatus for ordering a plurality of page data that is scanned, comprising: 
            an OCR unit that performs optical character recognition for character and layout in a page for each of the plurality of page data; 
                a rule order unit configured to classify each of the plurality of page data based on a page ordering rule according to the characters and the layout that are recognized by optical character recognition by the OCR unit, extract a page number, and calculate certainty of the page number; and 
                an ML order unit configured to classify page data of a page with low certainty calculated by the rule order unit by machine learning and infer the page number; 
                wherein 
                the rule order unit is configured to calculate the certainty by similarity of the layout, similarity of font, and the similarity of extraction result of the page number. 
               |