CPC G06N 3/082 (2013.01) [G06F 18/2148 (2023.01); G06F 18/217 (2023.01); G06N 3/047 (2023.01); G06N 3/08 (2013.01)] | 20 Claims |
1. A system comprising:
at least one processor; and
memory storing instructions that, when executed by the at least one processor, causes the system to perform a set of operations, the set of operations comprising:
iteratively tuning a probability distribution associated with a neural network architecture parameter for generating a neural network, the iteratively tuning comprising:
generating a sampled neural network architecture using the probability distribution;
evaluating training data from a training data store using the sampled neural network architecture to compute a gradient of a loss function associated with the sampled neural network architecture;
updating the probability distribution for the neural network architecture parameter based on the computed gradient of the loss function, thereby generating an updated iteration of the probability distribution for a subsequent iteration of tuning the probability distribution; and
evaluating the probability distribution based on termination criteria to determine whether the termination criteria is satisfied; and
when it is determined that the termination criteria is satisfied, generating a result neural network architecture having a value for the parameter based on the iteratively tuned probability distribution.
|