CPC G06N 3/084 (2013.01) [G06F 13/4282 (2013.01); G06N 3/04 (2013.01)] | 20 Claims |
1. A graphics processing unit, comprising:
a graphics processing unit cache memory,
wherein the graphics processing unit is communicatively coupled to a central processing unit comprising a central processing unit cache memory,
wherein the graphics processing unit, during a forward pass process of training a deep neural network that traverses through a set of layers of the deep neural network from a first layer of the set of layers to a last layer of the set of layers, transmits, to the central processing unit for storage in the central processing unit cache memory, data from the graphics processing unit cache memory employed for the training by an intermediate layer of the set of layers between the first layer and the last layer, and
wherein the graphics processing unit has determined that at least a portion of the data will be employed by the intermediate layer during a backward pass process of training the deep neural network that traverses from the last layer to the first layer.
|