US 11,893,767 B2
Text recognition method and apparatus
Jieming Li, Shenzhen (CN); Jianchao Huang, Shenzhen (CN); Xing Zhou, Shenzhen (CN); Yongfei Pu, Shenzhen (CN); Yuanlin Chen, Shenzhen (CN); and Lifei Zhu, Shenzhen (CN)
Assigned to HUAWEI TECHNOLOGIES CO., LTD., Shenzhen (CN)
Filed by HUAWEI TECHNOLOGIES CO., LTD., Guangdong (CN)
Filed on Jun. 10, 2022, as Appl. No. 17/837,231.
Application 17/837,231 is a continuation of application No. PCT/CN2020/130217, filed on Nov. 19, 2020.
Claims priority of application No. 201911285619.0 (CN), filed on Dec. 13, 2019.
Prior Publication US 2022/0301328 A1, Sep. 22, 2022
Int. Cl. G06V 10/26 (2022.01); G06V 30/146 (2022.01); G06V 30/148 (2022.01); G06V 10/22 (2022.01); G06V 30/19 (2022.01); G06V 20/62 (2022.01); G06V 30/168 (2022.01); G06F 18/241 (2023.01); G06V 30/10 (2022.01); G06V 30/16 (2022.01)
CPC G06V 10/267 (2022.01) [G06F 18/241 (2023.01); G06V 10/22 (2022.01); G06V 20/62 (2022.01); G06V 30/1473 (2022.01); G06V 30/15 (2022.01); G06V 30/153 (2022.01); G06V 30/158 (2022.01); G06V 30/168 (2022.01); G06V 30/19173 (2022.01); G06V 30/10 (2022.01); G06V 30/1607 (2022.01)] 21 Claims
OG exemplary drawing
 
1. A text recognition method, comprising:
obtaining a to-be-detected image;
determining a target text detection area in the to-be-detected image, wherein the target text detection area comprises target text in the to-be-detected image, the target text detection area is a polygonal area, the polygonal area comprises m vertex pairs, m is a positive integer greater than 2, and each of the m vertex pairs comprises two vertices including a vertex located on one side of the target text and another vertex located on another side of the target text, so that m vertices are located on one side of the target text, and other m vertices are located on another side of the target text;
correcting the polygonal area to m−1 rectangular areas to obtain a corrected target text detection area based on the m vertex pairs; and
performing text recognition on the corrected target text detection area to determine the target text, and outputting the target text,
wherein the determining of the target text detection area in the to-be-detected image comprises:
determining a plurality of candidate center points in the to-be-detected image and a vertex pair corresponding to each of the plurality of candidate center points, wherein each of the plurality of candidate center points is associated with a confidence level indicating likelihood of the associated candidate center point being a center point in the target text detection area in a text height direction; and
determining, based on the confidence level of each of the plurality of candidate center points, center points forming a center line of the target text detection area, wherein the center line passes though all pieces of text in the target text detection area.