| CPC G06V 30/1456 (2022.01) [G06F 3/017 (2013.01); G06V 40/28 (2022.01); G09B 5/065 (2013.01); G10L 13/08 (2013.01)] | 12 Claims |

|
1. A text processing method, comprising:
collecting a to-be-processed text image, and performing gesture recognition on the to-be-processed text image to obtain a to-be-processed text, wherein the to-be-processed text is a text selected from the to-be-processed text image through a gesture;
performing voice broadcasting on the to-be-processed text to prompt a user to perform dictation processing on the to-be-processed text; and
collecting a dictation text image, performing recognition on the dictation text image, and determining a dictation check result according to a recognition result and the to-be-processed text,
wherein collecting the to-be-processed text image, and performing the gesture recognition on the to-be-processed text image to obtain the to-be-processed text comprises:
collecting, according to an entry type of the to-be-processed text, the to-be-processed text image, and determining, by performing the gesture recognition on the to-be-processed text image, the to-be-processed text selected through the gesture;
wherein collecting, according to the entry type of the to-be-processed text, the to-be-processed text image, and determining, by performing the gesture recognition on the to-be-processed text image, the to-be-processed text selected through the gesture comprises:
based on a determination result that the entry type of the to-be-processed text is character-by-character entry, collecting a text image at a fingertip point of the user as the to-be-processed text image; and
using a text above the fingertip point of the user in the to-be-processed text image as the to-be-processed text.
|