| CPC G06N 3/08 (2013.01) | 12 Claims |

|
1. A computer for distributed processing, comprising:
a plurality of accelerators having a part of a neural network assigned to each thereof, each of the plurality of accelerators being configured to derive a learning result based on input data and update each parameter value included in the part of the neural network by using the learning result;
a plurality of network interface circuits, each being configured to transmit and receive information on learning including the learning result; and
an arithmetic processing unit configured to control the plurality of accelerators and the plurality of network interface circuits so as to cause each of the plurality of accelerators to derive a learning result based on input data and so as to cause the plurality of network interface circuits to transmit and receive, in parallel, information on learning including the learning result derived by each of the plurality of accelerators, wherein
the plurality of accelerators perform, in pipelines, feed forward processing and back propagation processing, for a mini batch including a plurality of pieces of input data, based on each of the plurality of pieces of input data, thereby deriving a plurality of learning results based on each of the plurality of pieces of input data; and
the plurality of network interface circuits sequentially transmit and receive information on learning with the back propagation processing overlapped each time the plurality of accelerators derive the plurality of learning results based on the plurality of pieces of input data included in the mini batch, the information on learning including the learning results.
|