US 12,315,522 B2
	Multichannel audio signal processing method, apparatus, and system
Zhe Wang, Beijing (CN)
Assigned to HUAWEI TECHNOLGOIES CO., LTD., Shenzhen (CN)
Filed by Huawei Technologies Co., Ltd., Shenzhen (CN)
Filed on Jan. 23, 2024, as Appl. No. 18/420,007.
Application 18/420,007 is a continuation of application No. 17/232,679, filed on Apr. 16, 2021, granted, now 11,922,954.
Application 17/232,679 is a continuation of application No. 16/781,421, filed on Feb. 4, 2020, granted, now 10,984,807, issued on Apr. 20, 2021.
Application 16/781,421 is a continuation of application No. 16/368,208, filed on Mar. 28, 2019, granted, now 10,593,339, issued on Mar. 17, 2020.
Application 16/368,208 is a continuation of application No. PCT/CN2016/100617, filed on Sep. 28, 2016.
Prior Publication US 2024/0233736 A1, Jul. 11, 2024
This patent is subject to a terminal disclaimer.
Int. Cl. G10L 19/008 (2013.01); G10L 19/00 (2013.01); G10L 19/012 (2013.01); H04S 3/00 (2006.01); G10L 19/24 (2013.01); G10L 25/78 (2013.01)

CPC G10L 19/008 (2013.01) [G10L 19/00 (2013.01); G10L 19/012 (2013.01); H04S 3/008 (2013.01); G10L 19/24 (2013.01); G10L 25/78 (2013.01); H04S 2400/03 (2013.01)]

20 Claims

1. A terminal comprising:

an encoder comprising a first algorithm and configured to:

mix N^th-frame audio signals of two of a plurality of channels of a multi-channel audio signal based on the first algorithm to obtain an N^th-frame downmixed signal, wherein N is a positive integer greater than zero;

detect, using voice activity detection (VAD), whether the N^th-frame downmixed signal comprises a speech signal; and

encode the N^th-frame downmixed signal into a bitstream when detecting that the N^th-frame downmixed signal does not comprise the speech signal and when the N^th-frame downmixed signal satisfies a preset audio frame encoding condition; and

a transmitter coupled to the encoder and configured to transmit the bitstream.