| CPC G10L 19/06 (2013.01) [G06N 3/045 (2023.01); G06N 3/088 (2013.01); G10L 25/30 (2013.01)] | 17 Claims |

|
1. A method of training an audio separation network, the method comprising:
obtaining a first separation sample set, the first separation sample set comprising at least two types of audio signals with dummy labels;
obtaining a first sample set by performing interpolation on the first separation sample set based on perturbation data;
obtaining a second separation sample set by separating the first sample set using an unsupervised network;
determining losses of second separation samples in the second separation sample set; and
adjusting network parameters of the unsupervised network based on the losses of the second separation samples, such that a first loss of a first separation result outputted by an adjusted unsupervised network meets a convergence condition,
wherein the determining the losses of the second separation samples in the second separation sample set comprises obtaining a loss set by:
determining a loss between each second separation sample and true value data of the first separation sample set; and
obtaining a loss of each second separation sample, and
wherein the adjusting the network parameters of the unsupervised network based on the losses of the second separation samples comprises obtaining updated network parameters by:
determining a minimum loss from the loss set; and
updating the network parameters of the unsupervised network based on the minimum loss.
|