US 11,984,134 B2
Method of processing audio data, electronic device and storage medium
Jiankang Hou, Beijing (CN); Zhipeng Nie, Beijing (CN); Liqiang Zhang, Beijing (CN); Tao Sun, Beijing (CN); and Lei Jia, Beijing (CN)
Assigned to BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD., Beijing (CN)
Filed by BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD., Beijing (CN)
Filed on Nov. 29, 2022, as Appl. No. 18/071,187.
Claims priority of application No. 202111454677.9 (CN), filed on Nov. 30, 2021.
Prior Publication US 2023/0087531 A1, Mar. 23, 2023
Int. Cl. G10L 25/18 (2013.01); G10L 25/30 (2013.01)
CPC G10L 25/18 (2013.01) [G10L 25/30 (2013.01)] 12 Claims
OG exemplary drawing
 
1. A method of processing audio data, the method comprising:
inputting spectral data of the audio data into a feature extraction model to obtain a first feature vector;
inputting the first feature vector into a periodic/aperiodic indicator detection model to obtain a periodic/aperiodic indicator, wherein the periodic/aperiodic indicator is a feature vector indicating periodic audio data of the first feature vector and aperiodic audio data of the first feature vector;
concatenating the first feature vector and the periodic/aperiodic indicator to obtain a second feature vector;
inputting the second feature vector into a fundamental frequency detection model to obtain a fundamental frequency;
concatenating the first feature vector, the periodic/aperiodic indicator and the fundamental frequency to obtain a third feature vector;
obtaining a spectral energy according to the third feature vector;
obtaining a harmonic structure of the audio data according to the fundamental frequency and the spectral energy;
obtaining a noise information in the audio data according to the first feature vector; and
obtaining synthetic audio data according to the harmonic structure and the noise information.