CPC G06F 12/0207 (2013.01) [G06F 7/523 (2013.01); G06N 3/063 (2013.01); G06N 20/00 (2019.01); G06F 2212/2024 (2013.01)] | 17 Claims |
1. A device comprising
a tile array comprising a plurality of tiles, wherein the device is configured to perform operations comprising:
receiving a plurality of input activations for a first layer of a model; and
performing, at each time step of a plurality of time steps, operations on tile columns within each tile wide column of a plurality of tile wide columns of the tile array, wherein each tile wide column of the tile array comprises multiple tile columns, the operations comprising:
distributing different feature values of a respective single input activation location along the tile column of a respective tile wide column,
performing respective compute operations using the different feature value for tiles in the respective tile column for the time step,
computing a respective output result for each respective tile column for the time step including computing a sum of results of the compute operations for the tile column, and
storing the respective output result for the tile column in a particular output RAM from which the output result will be read by a subsequent layer of the model.
|