CPC G06N 3/082 (2013.01) | 20 Claims |
1. A method, comprising:
pruning, by a processor, a layer of a neural network using a first threshold, the neural network comprising multiple layers, the pruning comprising configuring weights of the layer based on the first threshold;
determining, by the processor, a pruning error of the layer based on a percentage of the configured weights of the layer with respect to an initial number of weights of the layer; and
repeatedly configuring, by the processor, the weights of the layer until the pruning error of the layer equals a pruning error allowance for the layer, wherein the repeatedly configuring the weights of the layer allows a percentage of the number of weights remaining after the repeatedly configuring the weights of the layer with respect to the initial number of weights of the layer to be determined, and wherein each iteration of the repeatedly configuring the weights of the layer uses a different first threshold based on the pruning error.
|