US 12,260,853 B2
Speech processing method and apparatus
Yinping Zhang, Beijing (CN); Chenyu Zhang, Beijing (CN); and Lili Guo, Beijing (CN)
Assigned to LENOVO (BEIJING) LIMITED, Beijing (CN)
Filed by Lenovo (Beijing) Limited, Beijing (CN)
Filed on Mar. 10, 2022, as Appl. No. 17/654,270.
Claims priority of application No. 202110645953.3 (CN), filed on Jun. 10, 2021.
Prior Publication US 2022/0399012 A1, Dec. 15, 2022
Int. Cl. G10L 15/22 (2006.01); G10L 15/05 (2013.01); G10L 15/06 (2013.01); G10L 15/16 (2006.01); G10L 15/18 (2013.01); G10L 15/08 (2006.01)
CPC G10L 15/1815 (2013.01) [G10L 15/05 (2013.01); G10L 15/063 (2013.01); G10L 15/16 (2013.01); G10L 15/22 (2013.01); G10L 2015/088 (2013.01)] 18 Claims
OG exemplary drawing
 
1. A speech processing method, comprising:
obtaining first speech information from a user, wherein a duration of the first speech information exceeds a preset analysis duration threshold;
determining one or more similar speech segments in the first speech information and deleting one or more similar frames in the one or more similar speech segments to obtain second speech information, wherein a duration of the second speech information does not exceed the preset analysis duration threshold, and deleting the one or more similar frames in the one or more similar speech segments to obtain the second speech information comprises:
determining, according to the duration of the first speech information and the preset analysis duration threshold, a deletion ratio; and
deleting, according to the deletion ratio, the one or more similar frames in the one or more similar speech segments to obtain the second speech information; and
analyzing the second speech information to determine a user intent corresponding to the first speech information.