| CPC G10L 19/008 (2013.01) [G10L 19/00 (2013.01); G10L 19/012 (2013.01); H04S 3/008 (2013.01); G10L 19/24 (2013.01); G10L 25/78 (2013.01); H04S 2400/03 (2013.01)] | 20 Claims |

|
1. A terminal comprising:
an encoder comprising a first algorithm and configured to:
mix Nth-frame audio signals of two of a plurality of channels of a multi-channel audio signal based on the first algorithm to obtain an Nth-frame downmixed signal, wherein N is a positive integer greater than zero;
detect, using voice activity detection (VAD), whether the Nth-frame downmixed signal comprises a speech signal; and
encode the Nth-frame downmixed signal into a bitstream when detecting that the Nth-frame downmixed signal does not comprise the speech signal and when the Nth-frame downmixed signal satisfies a preset audio frame encoding condition; and
a transmitter coupled to the encoder and configured to transmit the bitstream.
|