CPC G06N 3/063 (2013.01) [G06F 17/16 (2013.01); G06N 3/04 (2013.01); G06N 3/045 (2023.01); G06N 3/084 (2013.01)] | 20 Claims |
1. A system comprising:
a set of processing units; and
a non-transitory machine-readable medium storing instructions that when executed by at least one processing unit in the set of processing units cause the at least one processing unit to:
receive, at a layer included in a neural network, a first matrix;
compress the first matrix to produce a second matrix to reduce an amount of hardware resources utilized to process the second matrix, the second matrix having a reduced dimensionality relative to a dimensionality of the first matrix;
process the second matrix through a network block in the layer included in the neural network;
expand the processed second matrix to produce a third matrix, the third matrix having a dimensionality that is equal to a dimensionality of the first matrix; and
add the third matrix to the first matrix to produce a fourth matrix.
|