CPC H03M 7/70 (2013.01) [G06F 18/211 (2023.01); G06F 18/2155 (2023.01); G06N 3/084 (2013.01); G06N 5/046 (2013.01); G06V 10/764 (2022.01); G06V 10/82 (2022.01); H03M 7/60 (2013.01)] | 20 Claims |
8. A computer system for compressing a deep neural network model, the computer system comprising:
one or more computer-readable non-transitory storage media configured to store computer program code; and
one or more computer processors configured to access said computer program code and operate as instructed by said computer program code, said computer program code including:
quantizing and entropy-coding code configured to cause the one or more computer processors to quantize and entropy-code weight coefficients associated with the deep neural network;
smoothing code configured to cause the one or more computer processors to locally smooth the quantized and entropy-coded weight coefficients; and
compressing code configured to cause the one or more computer processors to compress the smoothed weight coefficients based on applying a variational dropout to the weight coefficients.
|