CPC G06V 10/247 (2022.01) [G06K 7/1413 (2013.01); G06N 3/02 (2013.01); G06T 3/60 (2013.01); G06T 7/13 (2017.01); G06V 10/44 (2022.01); G06V 10/454 (2022.01); G06V 10/82 (2022.01); G06V 10/95 (2022.01); G06V 30/1478 (2022.01); G06V 30/413 (2022.01); G06T 2207/20084 (2013.01); G06T 2207/30176 (2013.01)] | 14 Claims |
1. A method for recognizing an imaged information-bearing medium, including:
acquiring a first image of the imaged information-bearing medium comprising:
performing target detection and correction on the imaged information-bearing medium in an original image based on the acquired original image to acquire the first image;
performing text recognition on the first image to acquire a text content of the imaged information-bearing medium;
classifying the imaged information-bearing medium to acquire a type of the imaged information-bearing medium; and
archiving the text content according to the type,
wherein performing text recognition on the first image to acquire a text content of the imaged information-bearing medium includes:
performing text detection on the first image to acquire multiple second images; and
recognizing the multiple second images with a preset text recognition network model to acquire the text content of the imaged information-bearing medium;
wherein performing target detection and correction on the imaged information-bearing medium in an original image based on the acquired original image to acquire the first image comprises:
performing image binarization based on the acquired original image;
performing edge detection to acquire an outline of the largest rectangle in the original image, or performing straight line detection to acquire groups of a horizontal straight line set and a vertical straight line set, and merging approximate parallel lines to determine an optimal boundary and vertices of the imaged information-bearing medium;
segmenting the first image from the original image by perspective transformation.
|