CPC G06N 20/00 (2019.01) | 20 Claims |
1. A method comprising, at each of multiple time steps:
generating, based on quality measures of data augmentation policies generated at one or more previous time steps, a current data augmentation policy that defines a procedure for transforming training inputs for a neural network, wherein:
for each previous time step, the quality measure of the data augmentation policy generated at the previous time step characterizes a performance measure of the neural network after the neural network has been trained using the data augmentation policy generated at the previous time step; and
the current data augmentation policy defines the procedure for transforming training inputs by defining, for each transformation operation in a sequence of transformation operations for the current data augmentation policy:
(i) a respective probability distribution over a space of transformation types for the transformation operation, and
(ii) a respective probability distribution over a space of transformation magnitudes for the transformation operation;
training the neural network using the current data augmentation policy, comprising:
selecting a batch of training inputs;
determining an augmented batch of training inputs by transforming the training inputs in the batch of training inputs in accordance with the current data augmentation policy, comprising, for each training input in the batch of training inputs:
for each transformation operation in the sequence of transformation operations for the current data augmentation policy:
selecting a transformation type for the transformation operation in accordance with the probability distribution over the space of transformation types for the transformation operation;
selecting a transformation magnitude for the transformation operation in accordance with the probability distribution over the space of transformation magnitudes for the transformation operation;
transforming the training input by applying the transformation operation with the selected transformation type and transformation magnitude; and
adjusting current values of parameters of the neural network by training the neural network on the augmented batch of training inputs that have been transformed in accordance with the current data augmentation policy by a machine learning training technique to optimize an objective function; and
determining a quality measure of the current data augmentation policy using the neural network after the neural network has been trained using the current data augmentation policy.
|