CPC G06F 13/4027 (2013.01) [G06N 3/04 (2013.01)] | 19 Claims |
1. A configurable stacked architecture for a fixed function datapath for use with an accelerator to accelerate an operation of a deep neural network (DNN), comprising:
a plurality of configurable micro-scalar processing units (SPUs) that perform at least one scalar operation on vector values from a received vector; and
a plurality of configurable micro-multi-functional units (MFUs) that perform vector operations on the vector values, wherein the plurality of configurable micro-SPUs and the plurality of configurable micro-MFUs are placed in an order to perform the operation of the DNN, wherein the order includes:
an output of a first micro-SPU of the plurality of configurable micro-SPUs being provided as an input to a first micro-MFU of the plurality of configurable micro-MFUs, and
an output of the first micro-MFU being provided as input to a second micro-SPU of the plurality of configurable micro-SPUs.
|