US 12,008,475 B2
Transposed sparse matrix multiply by dense matrix for neural network training
Hao Wu, Santa Clara, CA (US)
Assigned to NVIDIA Corporation, Santa Clara, CA (US)
Filed by NVIDIA Corporation, Santa Clara, CA (US)
Filed on Nov. 14, 2018, as Appl. No. 16/191,201.
Prior Publication US 2020/0151571 A1, May 14, 2020
Int. Cl. G06N 3/084 (2023.01)
CPC G06N 3/084 (2013.01) 32 Claims
OG exemplary drawing
 
1. A computer-implemented method, comprising:
causing one or more multiply operations to be performed on elements of a sparse matrix and a dense matrix based, at least in part, on a sparse matrix index map identifying the elements, on which the one or more multiply operations are to be performed; and
causing results of the one or more multiply operations to be accumulated in one or more corresponding storage locations according to the sparse matrix index map.