US 12,437,767 B2
Multi-channel audio signal encoding and decoding method and apparatus
Zhi Wang, Beijing (CN); Jiance Ding, Beijing (CN); Bingyin Xia, Beijing (CN); Bin Wang, Shenzhen (CN); and Zhe Wang, Beijing (CN)
Assigned to HUAWEI TECHNOLOGIES CO., LTD., Shenzhen (CN)
Filed by Huawei Technologies Co., Ltd., Guangdong (CN)
Filed on Jan. 11, 2023, as Appl. No. 18/153,128.
Application 18/153,128 is a continuation of application No. PCT/CN2021/106101, filed on Jul. 13, 2021.
Claims priority of application No. 202010699706.7 (CN), filed on Jul. 17, 2020.
Prior Publication US 2023/0154471 A1, May 18, 2023
Int. Cl. G10L 19/008 (2013.01); G10L 25/06 (2013.01)
CPC G10L 19/008 (2013.01) [G10L 25/06 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A multi-channel audio signal encoding method, comprising:
obtaining a to-be-encoded first audio frame, wherein the first audio frame comprises at least five channel signals;
obtaining a correlation value set, wherein the correlation value set comprises respective correlation values of a plurality of channel pairs, wherein one channel pair of the plurality of channel pairs comprises two channel signals of the at least five channel signals, and wherein a correlation value of the channel pair indicates correlation between the two channel signals of the channel pair;
selecting M correlation values from the correlation value set, wherein all the M correlation values are greater than correlation values other than the M correlation values in the correlation value set, wherein all the M correlation values are greater than or equal to a pairing threshold, and wherein M is a positive integer less than or equal to a specified value;
obtaining M channel pair sets, wherein each channel pair set comprises one or more channel pairs corresponding to the M correlation values, and wherein when the channel pair set comprises at least two channel pairs, the at least two channel pairs do not comprise a same channel signal;
determining a target channel pair set from the M channel pair sets, wherein a sum of correlation values of all channel pairs in the target channel pair set is the largest in those of the M channel pair sets; and
encoding the first audio frame based on the target channel pair set.