| CPC G06N 3/048 (2023.01) [G06F 7/49957 (2013.01); G06F 7/556 (2013.01); G06F 17/11 (2013.01); G06F 17/17 (2013.01); G06N 3/045 (2023.01); G06N 3/063 (2013.01); G06N 3/084 (2013.01); G06N 3/044 (2023.01)] | 20 Claims |

|
1. A method for implementing a deep neural network, the method comprising:
receiving an input vector to be processed in the deep neural network, the input vector comprising one or more floating-point input elements, the deep neural network comprising one or more hidden layers;
converting the one or more floating-point input elements into one or more fixed-point input elements;
inputting the one or more fixed-point input elements into the one or more hidden layers;
computing, by the one or more hidden layers, one or more fixed-point output elements by:
converting one or more floating-point weights of the one or more hidden layers into one or more fixed-point weights, and
performing, in the one or more hidden layers, a convolution using the one or more fixed-point weights and the one or more fixed-point input elements;
converting the one or more fixed-point output elements into one or more floating-point output elements; and
generating an output of the deep neural network using the one or more floating-point output elements.
|