CPC G06N 3/04 (2013.01) [G06F 7/5443 (2013.01); G06F 17/15 (2013.01); G06F 17/153 (2013.01); G06N 3/045 (2023.01); G06N 3/08 (2013.01); G06N 3/082 (2013.01); G06N 3/084 (2013.01); G06N 3/10 (2013.01)] | 20 Claims |
1. A method comprising:
performing, using an electronic device, a training of a convolutional neural network (CNN) to generate coefficients associated with a layer of the CNN;
tuning, using the electronic device, the layer of the CNN by at least:
selecting some of the coefficients based on a sparsity target representative of a ratio of a number of zero coefficients to a total number of the coefficients of the layer; and
setting the selected coefficients to zero; and
for each respective nonzero coefficient of the coefficients, performing, using the electronic device, a block multiply accumulation (BMA) operation, wherein the BMA operation multiplies the respective nonzero coefficient with an input data block corresponding to the respective nonzero coefficient.
|