CPC G06N 3/08 (2013.01) [G06N 3/04 (2013.01); G06N 3/063 (2013.01); G06N 3/084 (2013.01)] | 17 Claims |
1. A method of optimizing a neural network having a plurality of layers, the method comprising:
obtaining architecture constraints for circuitry of an inference platform that implements the neural network, wherein the architecture constraints comprise a first architecture constraint based on structure of the circuitry of the inference platform and a related data parameter constraint;
training the neural network on a training platform using the architecture constraints to generate network parameters and feature maps for the plurality of layers; and
constraining, during the training of the neural network, the network parameters, the feature maps, or both the network parameters and the feature maps, based on the architecture constraints.
|