| CPC G10L 19/008 (2013.01) [G10L 19/167 (2013.01); H04S 7/00 (2013.01); H04S 2420/11 (2013.01)] | 18 Claims |

|
1. A three-dimensional audio signal encoding method performed by at least one processor, coupled with a memory configured to store a computer program, which when executed by the at least one processor, causes the at least one processor to perform the method, comprising:
obtaining a first correlation between a current frame of a three-dimensional audio signal and a representative virtual speaker set for a previous frame, wherein a virtual speaker in the representative virtual speaker set for the previous frame is used for encoding the previous frame of the three-dimensional audio signal, and the first correlation is used to determine whether to reuse the representative virtual speaker set for the previous frame when the current frame is encoded;
obtaining, after the obtaining the first correlation, a second correlation between the current frame and a candidate virtual speaker set, wherein the second correlation is used to determine whether the candidate virtual speaker set is used when the current frame is encoded, and the representative virtual speaker set for the previous frame is a proper subset of the candidate virtual speaker set;
encoding, by an encoder, the current frame based on the representative virtual speaker set for the previous frame when the first correlation satisfies a reuse condition, to obtain a bitstream, wherein the reuse condition comprise: the first correlation being greater than the second correlation; and
decoding, with a decoder communicatively coupled with the encoder, the bitstream generated by the encoder.
|