| CPC G06N 3/08 (2013.01) [G06N 3/045 (2023.01)] | 20 Claims |

|
1. A neural network training method, performed by a computer device, the method comprising:
obtaining a training sample set, each training sample in the training sample set including a corresponding standard label;
training a neural network module based on inputting the each training sample in the training sample set into the neural network model, the neural network model comprising n attention networks, the n attention networks respectively mapping the each training sample to n different subspaces, each subspace of the n subspaces comprising a corresponding query vector sequence, a corresponding key vector sequence, and a corresponding value vector sequence, and n being an integer greater than 1;
determining a space difference degree between the n subspaces by using the neural network model;
determining an output similarity degree according to an output of the neural network model and the standard label corresponding to the each training sample; and
retraining the neural network model by adjusting a model parameter of the neural network model according to the space difference degree and the output similarity degree until a convergence condition is satisfied, thereby obtaining a target neural network model based on retraining the neural network model by adjusting the model parameter.
|