US 12,315,522 B2
Multichannel audio signal processing method, apparatus, and system
Zhe Wang, Beijing (CN)
Assigned to HUAWEI TECHNOLGOIES CO., LTD., Shenzhen (CN)
Filed by Huawei Technologies Co., Ltd., Shenzhen (CN)
Filed on Jan. 23, 2024, as Appl. No. 18/420,007.
Application 18/420,007 is a continuation of application No. 17/232,679, filed on Apr. 16, 2021, granted, now 11,922,954.
Application 17/232,679 is a continuation of application No. 16/781,421, filed on Feb. 4, 2020, granted, now 10,984,807, issued on Apr. 20, 2021.
Application 16/781,421 is a continuation of application No. 16/368,208, filed on Mar. 28, 2019, granted, now 10,593,339, issued on Mar. 17, 2020.
Application 16/368,208 is a continuation of application No. PCT/CN2016/100617, filed on Sep. 28, 2016.
Prior Publication US 2024/0233736 A1, Jul. 11, 2024
This patent is subject to a terminal disclaimer.
Int. Cl. G10L 19/008 (2013.01); G10L 19/00 (2013.01); G10L 19/012 (2013.01); H04S 3/00 (2006.01); G10L 19/24 (2013.01); G10L 25/78 (2013.01)
CPC G10L 19/008 (2013.01) [G10L 19/00 (2013.01); G10L 19/012 (2013.01); H04S 3/008 (2013.01); G10L 19/24 (2013.01); G10L 25/78 (2013.01); H04S 2400/03 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A terminal comprising:
an encoder comprising a first algorithm and configured to:
mix Nth-frame audio signals of two of a plurality of channels of a multi-channel audio signal based on the first algorithm to obtain an Nth-frame downmixed signal, wherein N is a positive integer greater than zero;
detect, using voice activity detection (VAD), whether the Nth-frame downmixed signal comprises a speech signal; and
encode the Nth-frame downmixed signal into a bitstream when detecting that the Nth-frame downmixed signal does not comprise the speech signal and when the Nth-frame downmixed signal satisfies a preset audio frame encoding condition; and
a transmitter coupled to the encoder and configured to transmit the bitstream.