US 12,462,821 B2
Low latency audio packet loss concealment
Peter Mariager, Nørresundby (DK); Jonas Koldkjaer Jensen, Nørresundby (DK); Filip Mathias Lillelund Jørgensen, Nørresundby (DK); and Ricco Jensen, Nørresundby (DK)
Assigned to RTX A/S, Norresundby (DK)
Appl. No. 18/002,081
Filed by RTX A/S, Nørresundby (DK)
PCT Filed Jun. 17, 2021, PCT No. PCT/EP2021/066362
§ 371(c)(1), (2) Date Dec. 16, 2022,
PCT Pub. No. WO2021/255153, PCT Pub. Date Dec. 23, 2021.
Claims priority of application No. PA 2020 70403 (DK), filed on Jun. 19, 2020.
Prior Publication US 2023/0230602 A1, Jul. 20, 2023
Int. Cl. G10L 19/02 (2013.01); G10L 19/005 (2013.01); G10L 25/30 (2013.01)
CPC G10L 19/02 (2013.01) [G10L 19/005 (2013.01); G10L 25/30 (2013.01)] 22 Claims
OG exemplary drawing
 
1. A method for concealing errors, the method comprising:
training a plurality of different Long Short-Term Memory (LSTM) neural network configurations or LSTM neural network data sets using respective different specific audio inputs,
providing (P_L_NN) a LSTM neural network with a plurality of nodes, wherein the LSTM neural network has been pre-trained with audio data,
receiving (R_P) a sequence of packets each comprising a set of modified discrete cosine transform (MDCT) coefficients associated with a frame comprising time-domain samples of an audio signal,
classifying audio in response to the sequence of packets, and selecting one of the different LSTM neural network configurations or LSTM neural network data sets to be used for estimated MDCT coefficient generation accordingly,
applying (A_P_L_NN) the sequence of packets to the LSTM neural network,
identifying (I_E_P) in the sequence of packets a packet to be an erroneous packet,
generating (G_CFF) estimated MDCT coefficients to replace the set of MDCT coefficients of the erroneous packet in response to an output from the LSTM neural network,
generating (G_CP) a concealment packet based on the estimated MDCT coefficients, and
replacing (R_E_P) the erroneous packet with the concealment packet.