CPC G06N 3/06 (2013.01) [G06N 3/04 (2013.01); G06N 3/063 (2013.01)] | 19 Claims |
1. A system for dynamic sparse execution of a neural network, comprising:
at least one global buffer configured to receive inputs for the neural network;
a plurality of processing elements configured to execute activation functions for nodes of the neural network; and
at least one processor configured to:
execute ternary random projection to reduce at least one dimension of the inputs from the at least one global buffer and generate a corresponding predictable output neuron map for use by the plurality of processing elements, wherein the at least one dimension is reduced according to a tunable degree for indicating a degree at which missing values are to be predicted rather than calculated;
iteratively receive current outputs of a current layer from the plurality of processing elements as inputs of a next layer, wherein at least of one of the current outputs is expanded by setting values corresponding to one or more predictable output neurons;
reduce at least one dimension of the current outputs, and update the corresponding predictable output neuron map, based on the reduced current outputs, for use by the plurality of processing elements in generating next outputs until the plurality of processing elements have executed each layer of the neural network.
|