US 11,893,765 B2
Method and apparatus for recognizing imaged information-bearing medium, computer device and medium
Guangwei Huang, Beijing (CN); Ruibin Xue, Beijing (CN); Bingchuan Shi, Beijing (CN); Yue Li, Beijing (CN); and Jibo Zhao, Beijing (CN)
Assigned to BOE TECHNOLOGY GROUP CO., LTD., Beijing (CN)
Appl. No. 17/279,684
Filed by BOE TECHNOLOGY GROUP CO., LTD., Beijing (CN)
PCT Filed May 20, 2020, PCT No. PCT/CN2020/091368
§ 371(c)(1), (2) Date Mar. 25, 2021,
PCT Pub. No. WO2020/233611, PCT Pub. Date Nov. 26, 2020.
Claims priority of application No. 201910417247.6 (CN), filed on May 20, 2019.
Prior Publication US 2022/0036115 A1, Feb. 3, 2022
Int. Cl. G06V 10/24 (2022.01); G06T 7/13 (2017.01); G06V 30/413 (2022.01); G06V 30/146 (2022.01); G06V 10/94 (2022.01); G06V 10/44 (2022.01); G06K 7/14 (2006.01); G06N 3/02 (2006.01); G06T 3/60 (2006.01); G06V 10/82 (2022.01)
CPC G06V 10/247 (2022.01) [G06K 7/1413 (2013.01); G06N 3/02 (2013.01); G06T 3/60 (2013.01); G06T 7/13 (2017.01); G06V 10/44 (2022.01); G06V 10/454 (2022.01); G06V 10/82 (2022.01); G06V 10/95 (2022.01); G06V 30/1478 (2022.01); G06V 30/413 (2022.01); G06T 2207/20084 (2013.01); G06T 2207/30176 (2013.01)] 14 Claims
OG exemplary drawing
 
1. A method for recognizing an imaged information-bearing medium, including:
acquiring a first image of the imaged information-bearing medium comprising:
performing target detection and correction on the imaged information-bearing medium in an original image based on the acquired original image to acquire the first image;
performing text recognition on the first image to acquire a text content of the imaged information-bearing medium;
classifying the imaged information-bearing medium to acquire a type of the imaged information-bearing medium; and
archiving the text content according to the type,
wherein performing text recognition on the first image to acquire a text content of the imaged information-bearing medium includes:
performing text detection on the first image to acquire multiple second images; and
recognizing the multiple second images with a preset text recognition network model to acquire the text content of the imaged information-bearing medium;
wherein performing target detection and correction on the imaged information-bearing medium in an original image based on the acquired original image to acquire the first image comprises:
performing image binarization based on the acquired original image;
performing edge detection to acquire an outline of the largest rectangle in the original image, or performing straight line detection to acquire groups of a horizontal straight line set and a vertical straight line set, and merging approximate parallel lines to determine an optimal boundary and vertices of the imaged information-bearing medium;
segmenting the first image from the original image by perspective transformation.