CPC G06F 17/153 (2013.01) [G06F 7/5443 (2013.01); G06N 3/04 (2013.01); G06N 3/063 (2013.01); G06N 3/045 (2023.01)] | 20 Claims |
1. An input buffer for a convolution layer of a deep neural network, configured to:
store in a memory of the input buffer a plurality of input data values for the convolution layer;
provide the plurality of input data values to a plurality of convolution engines over a plurality of cycles wherein in any cycle each convolution engine is provided with a different set of input data values relative to the other convolution engines;
prior to providing a set of input data values of the plurality of input data values to a convolution engine for a first time, fetch that set of input data values from the memory of the input buffer; and
store the K most recent sets of input data values fetched from the memory of the input buffer in a cache, wherein K is an integer greater than or equal to two, and if a portion of a next set of input data values to be fetched is in the cache that portion of the next set of input data values is not fetched from the memory of the input buffer and is obtained from the cache.
|