CPC G06N 3/084 (2013.01) [G06F 9/30141 (2013.01); H03M 7/3059 (2013.01); H03M 7/46 (2013.01); H03M 7/702 (2013.01)] | 20 Claims |
1. A computing system comprising:
one or more hardware processors;
at least one memory coupled to the one or more hardware processors; and
one or more computer-readable storage media storing computer-executable instructions that, when executed, cause the computing system to perform operations comprising:
performing forward propagation for a first layer of a neural network using values in a first floating-point format to produce first activation values in the first floating-point format or a second floating-point format different than the first floating-point format, the first floating-point format and the second floating-point format having uniform mantissas;
converting at least one of the activation values to a third floating-point format having a non-uniform mantissa to provide compressed activation values;
storing the compressed activation values in the at least one memory; and
propagating the activation values in the first floating-point format or the second floating point format to a second layer of the neural network to produce second activation values in a fourth floating-point format, the fourth floating-point format being different than the first floating point format and being different than the second floating point format.
|