US 11,810,546 B2
Sample generation method and apparatus
Dongxiao Wang, Beijing (CN); Mingqi Yang, Beijing (CN); Nan Ma, Beijing (CN); Long Xia, Beijing (CN); and Changzhen Guo, Beijing (CN)
Assigned to Beijing Yuanli Weilai Science and Technology Co., Ltd., Beijing (CN)
Appl. No. 18/253,717
Filed by BEIJING YUANLI WEILAI SCIENCE AND TECHNOLOGY CO., LTD., Beijing (CN)
PCT Filed Nov. 12, 2021, PCT No. PCT/CN2021/130459
§ 371(c)(1), (2) Date May 19, 2023,
PCT Pub. No. WO2022/105693, PCT Pub. Date May 27, 2022.
Claims priority of application No. 202011309190.7 (CN), filed on Nov. 20, 2020.
Prior Publication US 2023/0317052 A1, Oct. 5, 2023
Int. Cl. G10L 15/22 (2006.01); G10L 13/02 (2013.01); G10L 13/08 (2013.01)
CPC G10L 13/02 (2013.01) [G10L 13/08 (2013.01)] 16 Claims
OG exemplary drawing
 
1. A sample generation method, comprising:
acquiring a plurality of text-audio pairs, wherein each text-audio pair comprises a text segment and an audio segment;
calculating, for each text-audio pair among the plurality of text-audio pairs, an audio feature of the audio segment of the text-audio pair, and screening out from the plurality of text-audio pairs, according to the audio feature, a target text-audio pair and a splicing text-audio pair corresponding to the target text-audio pair;
splicing the target text-audio pair and the splicing text-audio pair into a to-be-detected text-audio pair, and detecting the to-be-detected text-audio pair; and
writing the to-be-detected text-audio pair into a training database in a case that the to-be-detected text-audio pair meets a preset detection condition.