US 11,893,765 B2
	Method and apparatus for recognizing imaged information-bearing medium, computer device and medium
Guangwei Huang, Beijing (CN); Ruibin Xue, Beijing (CN); Bingchuan Shi, Beijing (CN); Yue Li, Beijing (CN); and Jibo Zhao, Beijing (CN)
Assigned to BOE TECHNOLOGY GROUP CO., LTD., Beijing (CN)
Appl. No. 17/279,684
Filed by BOE TECHNOLOGY GROUP CO., LTD., Beijing (CN)
PCT Filed May 20, 2020, PCT No. PCT/CN2020/091368 § 371(c)(1), (2) Date Mar. 25, 2021, PCT Pub. No. WO2020/233611, PCT Pub. Date Nov. 26, 2020.
Claims priority of application No. 201910417247.6 (CN), filed on May 20, 2019.
Prior Publication US 2022/0036115 A1, Feb. 3, 2022
Int. Cl. G06V 10/24 (2022.01); G06T 7/13 (2017.01); G06V 30/413 (2022.01); G06V 30/146 (2022.01); G06V 10/94 (2022.01); G06V 10/44 (2022.01); G06K 7/14 (2006.01); G06N 3/02 (2006.01); G06T 3/60 (2006.01); G06V 10/82 (2022.01)

CPC G06V 10/247 (2022.01) [G06K 7/1413 (2013.01); G06N 3/02 (2013.01); G06T 3/60 (2013.01); G06T 7/13 (2017.01); G06V 10/44 (2022.01); G06V 10/454 (2022.01); G06V 10/82 (2022.01); G06V 10/95 (2022.01); G06V 30/1478 (2022.01); G06V 30/413 (2022.01); G06T 2207/20084 (2013.01); G06T 2207/30176 (2013.01)]

14 Claims

1. A method for recognizing an imaged information-bearing medium, including:

acquiring a first image of the imaged information-bearing medium comprising:

performing target detection and correction on the imaged information-bearing medium in an original image based on the acquired original image to acquire the first image;

performing text recognition on the first image to acquire a text content of the imaged information-bearing medium;

classifying the imaged information-bearing medium to acquire a type of the imaged information-bearing medium; and

archiving the text content according to the type,

wherein performing text recognition on the first image to acquire a text content of the imaged information-bearing medium includes:

performing text detection on the first image to acquire multiple second images; and

recognizing the multiple second images with a preset text recognition network model to acquire the text content of the imaged information-bearing medium;

wherein performing target detection and correction on the imaged information-bearing medium in an original image based on the acquired original image to acquire the first image comprises:

performing image binarization based on the acquired original image;

performing edge detection to acquire an outline of the largest rectangle in the original image, or performing straight line detection to acquire groups of a horizontal straight line set and a vertical straight line set, and merging approximate parallel lines to determine an optimal boundary and vertices of the imaged information-bearing medium;

segmenting the first image from the original image by perspective transformation.