US 12,266,373 B2
Method and apparatus for audio processing, electronic device and storage medium
Mingshuang Luo, Beijing (CN); Fangjun Kuang, Beijing (CN); Liyong Guo, Beijing (CN); Long Lin, Beijing (CN); Wei Kang, Beijing (CN); Zengwei Yao, Beijing (CN); and Povey Daniel, Beijing (CN)
Assigned to BEIJING XIAOMI MOBILE SOFTWARE CO., LTD., Beijing (CN)
Filed by BEIJING XIAOMI MOBILE SOFTWARE CO., LTD., Beijing (CN)
Filed on Dec. 9, 2022, as Appl. No. 18/078,483.
Claims priority of application No. 202210616304.5 (CN), filed on May 31, 2022.
Prior Publication US 2023/0386483 A1, Nov. 30, 2023
Int. Cl. G10L 19/008 (2013.01); G10L 25/30 (2013.01)
CPC G10L 19/008 (2013.01) [G10L 25/30 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method for processing audio, comprising:
obtaining an audio encoding result, wherein the audio encoding result comprises a plurality of elements, and each element in the audio encoding result comprises a coordinate in an audio frame number dimension and a coordinate in a text label sequence dimension;
in response to that an output result of an ith frame in a decoding path is a non-null character, respectively increasing the coordinate in the audio frame number dimension and the coordinate in the text label sequence dimension corresponding to an output position of the ith frame by 1 to obtain an output position of a (i+1)th frame in the decoding path, wherein i is an integer greater than or equal to 1; and
determining an output result corresponding to the output position of the (i+1)th frame in the decoding path according to the output result of the ith frame in the decoding path and an element of the (i+1)th frame in the audio encoding result.