| CPC G06F 7/5443 (2013.01) [G06F 17/15 (2013.01); G06F 17/16 (2013.01)] | 17 Claims |

|
1. A computing device comprising:
one or more memories configured to store a first matrix and a second matrix; and
an array of processing elements, wherein each processing element includes;
a first register to receiving corresponding columns of the first matrix from the one or more memories;
a second register to receive corresponding rows of the second matrix from the one or more memories;
a multiplication unit to compute a partial product for a vector-vector outer-products of corresponding element values of the first matrix for each corresponding column of the first matrix in the first register and the corresponding element values of the second matrix for each corresponding row of the second matrix in the second register; and
a summation unit to accumulate the partial products for the vector-vector outer-products of the corresponding element values of the first matrix for each corresponding column of the first matrix and the corresponding element values of the second matrix for each corresponding row of the second matrix;
wherein the array of processing elements are configured to generate a third matrix of the accumulated partial products of the corresponding element values of the corresponding columns of the first matrix and corresponding rows of the second matrix from the summation unit as the partial product of the vector-vector-outer-products of
![]() |