US 11,755,659 B2
Document search device, document search program, and document search method
Yoshishige Okuno, Tokyo (JP); Takuya Minami, Tokyo (JP); Eriko Takeda, Tokyo (JP); and Hajime Hotta, Tokyo (JP)
Assigned to Resonac Corporation, Tokyo (JP)
Appl. No. 17/282,036
Filed by Resonac Corporation, Tokyo (JP)
PCT Filed Sep. 26, 2019, PCT No. PCT/JP2019/038016
§ 371(c)(1), (2) Date Apr. 1, 2021,
PCT Pub. No. WO2020/071252, PCT Pub. Date Apr. 9, 2020.
Claims priority of application No. 2018-189438 (JP), filed on Oct. 4, 2018.
Prior Publication US 2021/0374189 A1, Dec. 2, 2021
Int. Cl. G06F 16/903 (2019.01); G06F 16/907 (2019.01); G06F 16/93 (2019.01); G06F 16/58 (2019.01)
CPC G06F 16/90344 (2019.01) [G06F 16/5866 (2019.01); G06F 16/907 (2019.01); G06F 16/93 (2019.01)] 12 Claims
OG exemplary drawing
 
1. A document search device comprising:
a processor; and
a memory storing program instructions that cause the processor to
search for an input keyword in a document database in which document information including text data is stored, the text data being extracted by using a character recognition process from document image data generated by imaging a paper document;
select a similar keyword in accordance with a degree of similarity to the input keyword from a group of wildcard strings generated from the input keyword and search for the similar keyword in the document database, the degree of similarity being determined by comparing each character of the input keyword with a corresponding character of a wildcard string in the group of wildcard strings; and
output a search result obtained by searching for the input keyword in the document database and a search result obtained by searching for the similar keyword in the document database,
wherein the program instructions cause the processor to further extract, from the document database, a non-matched document information group that is a document information group other than the search result obtained by searching for the input keyword, and
search for the group of the wildcard strings in the non-matched document information group to obtain a group of wildcard strings that exist in the non-matched document information group,
wherein the processor selects the similar keyword in accordance with the degree of similarity to the input keyword from the group of the wildcard strings that exist in the non-matched document information group.