US 12,444,427 B2
	Audio encoding method, audio decoding method, apparatus, computer device, storage medium, and computer program product
Junbin Liang, Shenzhen (CN)
Assigned to TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED, Shenzhen (CN)
Filed by Tencent Technology (Shenzhen) Company Limited, Shenzhen (CN)
Filed on Nov. 1, 2022, as Appl. No. 17/978,905.
Application 17/978,905 is a continuation of application No. PCT/CN2022/081414, filed on Mar. 17, 2022.
Claims priority of application No. 202110380547.9 (CN), filed on Apr. 9, 2021.
Prior Publication US 2023/0046509 A1, Feb. 16, 2023
Int. Cl. G10L 19/24 (2013.01); G06N 3/044 (2023.01); G06N 3/08 (2023.01); G10L 19/16 (2013.01); G10L 25/60 (2013.01); G10L 25/69 (2013.01)

CPC G10L 19/24 (2013.01) [G06N 3/044 (2023.01); G06N 3/08 (2013.01); G10L 19/167 (2013.01); G10L 25/60 (2013.01); G10L 25/69 (2013.01)]

17 Claims

1. A method for training an encoding bit rate prediction model, performed by a computer device, the method comprising:

obtaining a sample audio feature parameter corresponding to each of sample audio frames in a first sample audio, further including obtaining an (i−1)^thsample encoding bit rate corresponding to an (i−1)^thsample audio frame;

performing encoding bit rate prediction on the sample audio feature parameter through an encoding bit rate prediction model, to obtain a sample encoding bit rate for each of the sample audio frames, further including:

performing encoding bit rate prediction on an i^thsample audio feature parameter and the (i−1)^thsample encoding bit rate through the encoding bit rate prediction model, to obtain an i^thsample encoding bit rate corresponding to an i^thsample audio frame, wherein i is an increasing integer and a value range thereof is 1<i≤N, N is a quantity of the sample audio frames, and N is an integer larger than 1;

performing audio encoding on the sample audio frames based on the corresponding sample encoding bit rates to generate sample audio data corresponding to the sample audio frames;

performing audio decoding on the sample audio data, to obtain a second sample audio corresponding to the sample audio data; and

training the encoding bit rate prediction model based on the first sample audio and the second sample audio until a sample encoding quality score reaches a target encoding quality score,

the sample encoding quality score being determined through the first sample audio and the second sample audio.