CPC G06V 30/22 (2022.01) [G06T 7/11 (2017.01); G06T 7/70 (2017.01); G06V 30/153 (2022.01); G06V 30/1801 (2022.01); G06V 30/222 (2022.01); G06V 30/32 (2022.01); G06T 2207/30176 (2013.01); G06V 30/1908 (2022.01)] | 18 Claims |
1. An information processing apparatus comprising:
one or more memories that store a program; and
one or more processors that execute the program to automatically perform:
separating a first image area including handwritten characters from a document image obtained by scanning a document;
extracting one or more character blocks each of which consists of some of the handwritten characters in proximity to one another and having a common baseline from the separated first image area;
generating, in a case where a plurality of character blocks is extracted from the first image area in the extracting, a combined single character block by combining character blocks based on a position relationship of the plurality of character blocks; and
performing optical character recognition processing for the generated combined single character block to obtain a character recognition result of the generated combined single character block.
|