CPC G06N 3/08 (2013.01) [G06F 17/18 (2013.01)] | 20 Claims |
1. A system comprising:
one or more artificial intelligence (AI) processors;
one or more control processors; and
a non-transitory computer readable storage medium having stored thereon program code executable by the one or more control processors, the program code causing the one or more control processors to:
receive an artificial neural network model and configure the model, based on a plurality of training parameters, to execute a training process on the one or more AI processors;
monitor a plurality of statistics produced upon execution of the training process; and
adjust one or more training parameters of the plurality of training parameters, based on one or more statistics of the plurality of statistics, to maintain at least one of the plurality of statistics within a predetermined range, wherein the plurality of training parameters includes a parameter configuring a precision associated with the artificial neural network model, and wherein the artificial neural network model is configured with a first precision associated with a first layer of the artificial neural network model and a second precision associated with a second layer of the artificial neural network model, the first precision being higher than the second precision, and the first layer being a layer closer to an input of the artificial neural network model and the second layer being a layer closer to an output of the artificial neural network model.
|