| CPC H04N 1/00641 (2013.01) [G06V 30/416 (2022.01); H04N 1/04 (2013.01); G06V 30/10 (2022.01)] | 8 Claims |

|
1. An information processing apparatus comprising:
a processor configured to:
acquire a plurality of scanned images obtained by scanning a bundle of paper media including a plurality of standard document bundles each of which is a set of a standard document and a related document related to the standard document;
extract a title from the plurality of scanned images;
extract, from the plurality of scanned images, an identifier that is assigned to be common within one standard document bundle and to be different between different standard document bundles;
only perform a comparison on identifiers extracted from the plurality of scanned images including the title;
divide the plurality of scanned images according to a comparison result of the identifiers extracted from the plurality of scanned images including the title; and
divide the plurality of scanned images into bundles such that a scanned image from which the title has been extracted is set as a bundle head, scanned images having the same identifier are included in the same bundle, and scanned images having different identifiers are included in different bundles.
|