CPC G10L 19/008 (2013.01) [G10L 25/30 (2013.01)] | 20 Claims |
1. A method for processing audio, comprising:
obtaining an audio encoding result, wherein the audio encoding result comprises a plurality of elements, and each element in the audio encoding result comprises a coordinate in an audio frame number dimension and a coordinate in a text label sequence dimension;
in response to that an output result of an ith frame in a decoding path is a non-null character, respectively increasing the coordinate in the audio frame number dimension and the coordinate in the text label sequence dimension corresponding to an output position of the ith frame by 1 to obtain an output position of a (i+1)th frame in the decoding path, wherein i is an integer greater than or equal to 1; and
determining an output result corresponding to the output position of the (i+1)th frame in the decoding path according to the output result of the ith frame in the decoding path and an element of the (i+1)th frame in the audio encoding result.
|