| CPC G06T 1/20 (2013.01) [G06F 16/1724 (2019.01); G06N 20/00 (2019.01); G06T 1/60 (2013.01)] | 14 Claims |

|
1. A graphics multiprocessor, comprising:
a plurality of compute engines to perform first computations to generate a first set of data of a process;
cache for storing data; and
a high density memory for temporal buffering that is integrated on a same semiconductor chip with the plurality of compute engines and the cache, the high density memory to receive the first set of data, to temporarily store the first set of data, and to provide the first set of data from the high density memory to the cache during a first time period that is prior to a second time period when the plurality of compute engines request the first set of data for second computations of the process, wherein the plurality of compute engines use the first set of data for second computations to generate a second set of data, wherein the first set of data comprises activation data for a forward pass of the process that will be transferred from the high density memory to the cache prior to when the activation data is needed during a backward pass of the process.
|