US 12,230,045 B2
	English char image recognition method
Chung-Hsing Chen, Taipei (TW)
Assigned to ADVANCED VIEW INC., Taipei (TW)
Filed by ADVANCED VIEW INC., Taipei (TW)
Filed on Sep. 1, 2022, as Appl. No. 17/901,538.
Claims priority of application No. 110137294 (TW), filed on Oct. 7, 2021.
Prior Publication US 2023/0146241 A1, May 11, 2023
Int. Cl. G06V 30/19 (2022.01); G06T 7/12 (2017.01); G06T 7/66 (2017.01); G06T 7/73 (2017.01); G06V 10/82 (2022.01)

CPC G06V 30/19127 (2022.01) [G06T 7/12 (2017.01); G06T 7/66 (2017.01); G06T 7/73 (2017.01); G06V 10/82 (2022.01); G06V 30/19173 (2022.01); G06T 2207/20021 (2013.01); G06T 2207/20076 (2013.01); G06T 2207/20084 (2013.01)]

9 Claims

1. An English char image recognition method applied to an electronic device with computing capabilities, the electronic device comprising a processing unit, the English char image recognition method at least comprising steps of:

loading an English character image, generating a rectangular coordinate frame for the English character image by the processing unit;

finding a gravity center of the English character image in the rectangular coordinate frame, radiating 12 straight lines in sequence from the gravity center toward a direction of the rectangular coordinate frame at an interval angle of 30°, defining a simulated distance on each of the straight lines and calculating 3 vector feature positions from the simulated distance in order to obtain 36 vector features in the rectangular coordinate frame and read a gray scale of the vector feature positions;

dividing into four equal parts by 5 dividing points respectively on upper and lower edges of the rectangular coordinate frame and defining the dividing points as positions of upper and lower edge features to obtain 5 edge features respectively, dividing into five equal parts by 6 dividing points respectively on left and right edges of the rectangular coordinate frame and defining the dividing points as positions of left and right edge features to obtain 4 edge features in order to obtain 18 edge features on the rectangular coordinate frame and read a gray scale of positions of the edge features;

merging 36 vector features and 18 edge features and arranging in a 1×54 array, adding a blank feature respectively at beginning and end of the 1×54 array, arranging and forming a feature map in a 1×56 array;

performing 6 times of one-dimensional convolutional operation to generate 6 first array feature maps with an array of 1×54;

performing a maximum pooling operation of convolutional neural network on each of the first array feature maps to generate 6 second array feature maps with an array of 1×27;

performing a one-dimensional convolution operation of convolutional neural network on each of the second array feature maps to generate 16 third array feature maps with an array of 1×25;

performing a full connection of convolutional neural network on the third array feature maps to form a fourth full connection layer/fourth array feature maps with an array of 1×300;

performing a full connection of convolutional neural network on the fourth full connection layer/fourth array feature maps to form a fifth full connection layer/fifth array feature maps with an array of 1×150;

performing a full connection of convolutional neural network on the fifth full connection layer/fifth array feature maps to form a sixth full connection layer/sixth array feature maps with an array of 1×52; and

the processing unit performing probability recognition according to the sixth full connection layer/sixth array feature maps with an array of 1×52 and outputting 52 probabilities, and among the outputted 52 probabilities, outputting a class with a probability value floating point number being closer to 1 as a character recognition result.