US 12,136,285 B2
Text processing method and apparatus, and electronic device and non-transitory computer-readable medium
Yujun Song, Beijing (CN)
Assigned to BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD., Beijing (CN)
Appl. No. 17/639,308
Filed by Beijing Bytedance Network Technology Co., Ltd., Beijing (CN)
PCT Filed Aug. 14, 2020, PCT No. PCT/CN2020/109213
§ 371(c)(1), (2) Date Feb. 28, 2022,
PCT Pub. No. WO2021/036823, PCT Pub. Date Mar. 4, 2021.
Claims priority of application No. 201910816906.3 (CN), filed on Aug. 30, 2019.
Prior Publication US 2022/0319347 A1, Oct. 6, 2022
Int. Cl. G06V 30/14 (2022.01); G06F 3/01 (2006.01); G06V 40/20 (2022.01); G09B 5/06 (2006.01); G10L 13/08 (2013.01)
CPC G06V 30/1456 (2022.01) [G06F 3/017 (2013.01); G06V 40/28 (2022.01); G09B 5/065 (2013.01); G10L 13/08 (2013.01)] 12 Claims
OG exemplary drawing
 
1. A text processing method, comprising:
collecting a to-be-processed text image, and performing gesture recognition on the to-be-processed text image to obtain a to-be-processed text, wherein the to-be-processed text is a text selected from the to-be-processed text image through a gesture;
performing voice broadcasting on the to-be-processed text to prompt a user to perform dictation processing on the to-be-processed text; and
collecting a dictation text image, performing recognition on the dictation text image, and determining a dictation check result according to a recognition result and the to-be-processed text,
wherein collecting the to-be-processed text image, and performing the gesture recognition on the to-be-processed text image to obtain the to-be-processed text comprises:
collecting, according to an entry type of the to-be-processed text, the to-be-processed text image, and determining, by performing the gesture recognition on the to-be-processed text image, the to-be-processed text selected through the gesture;
wherein collecting, according to the entry type of the to-be-processed text, the to-be-processed text image, and determining, by performing the gesture recognition on the to-be-processed text image, the to-be-processed text selected through the gesture comprises:
based on a determination result that the entry type of the to-be-processed text is character-by-character entry, collecting a text image at a fingertip point of the user as the to-be-processed text image; and
using a text above the fingertip point of the user in the to-be-processed text image as the to-be-processed text.