US 12,412,409 B2
	Information processing apparatus, information processing method, and storage medium
Ken Achiwa, Kanagawa (JP)
Assigned to CANON KABUSHIKI KAISHA, Tokyo (JP)
Filed by CANON KABUSHIKI KAISHA, Tokyo (JP)
Filed on Sep. 8, 2022, as Appl. No. 17/940,127.
Claims priority of application No. 2021-148491 (JP), filed on Sep. 13, 2021.
Prior Publication US 2023/0077608 A1, Mar. 16, 2023
Int. Cl. G06V 30/413 (2022.01); G06V 10/70 (2022.01); G06V 30/18 (2022.01); G06V 30/19 (2022.01)

CPC G06V 30/18019 (2022.01) [G06V 10/70 (2022.01); G06V 30/19093 (2022.01); G06V 30/413 (2022.01)]

17 Claims

1. An information processing apparatus that extracts one or more candidate character strings to be a candidate of an item value corresponding to a predetermined item, from among a plurality of character strings included in a document image to be processed, the information processing apparatus comprising:

one or more hardware processors; and

one or more memories storing one or more programs configured to be executed by the one or more hardware processors, the one or more programs including instructions for:

obtaining information on an attribute group of a preset extraction target among a predetermined plurality of attribute groups;

identifying the attribute group to which each of the plurality of character strings belongs from among the plurality of attribute groups based on a feature vector corresponding to each of the plurality of character strings, and extracting the character string whose identified attribute group is the same as the attribute group of the extraction target from among the plurality of character strings as the candidate character string; and

outputting the extracted candidate character string,

wherein information indicating a nearby group is obtained, the nearby group being the attribute group located near the attribute group of the extraction target,

wherein the character string whose identified attribute group is the same as the nearby group is also extracted from among the plurality of character strings as the candidate character string in addition to the character string whose identified attribute group is same as the attribute group of the extraction target, and

wherein a barycentric position of each of the plurality of attribute groups is calculated and the attribute group that has a barycentric position within a range of a predetermined threshold from a barycentric position of the attribute group of the extraction target is set as the nearby group, from among the plurality of attribute groups.