| CPC G06F 9/4881 (2013.01) [G06F 7/24 (2013.01); G06F 9/5027 (2013.01); G06F 9/544 (2013.01)] | 20 Claims |

|
1. A computing device comprising:
a hardware accelerator including a plurality of processing areas and a plurality of memory buffers, wherein:
the plurality of processing areas are arranged in a plurality of processing area groups that each include three or more of the processing areas arranged in a fully connected graph;
each processing area group of the plurality of processing area groups is connected to two or more other processing area groups by a first interconnect ring and a second interconnect ring; and
the hardware accelerator is configured to multi-cast each of a plurality of entries of a first input matrix and a second input matrix to the plurality of processing area groups by transmitting respective identical copies of the entries of the first input matrix and the second input matrix from an initiator processing area or an initiator memory buffer to each of the processing area groups.
|