CPC G06T 7/33 (2017.01) [G06N 3/04 (2013.01); G06N 3/063 (2013.01); G06N 3/08 (2013.01); G06T 1/20 (2013.01); G06T 2207/20081 (2013.01); G06T 2207/20084 (2013.01); G06T 2210/52 (2013.01)] | 17 Claims |
1. A neural network apparatus, comprising:
one or more processors comprising:
a controller configured to determine an operand pair including a shared operand to be shared in parallelized operations as being either one of a pixel value among pixel values of an input feature map and a weight value among weight values of a kernel, based on one of a shape of the input feature map and the weight value of the kernel being less than a predetermined value; and
processing units configured to perform the parallelized operations based on the operand pair including the determined shared operand and a respective operand from the input feature man or the kernel,
wherein multiple of the processing units are configured to perform the parallelized operations between the pixel value of the input feature map and respective weight values of the kernel as the respective operand, in parallel, in response to the controller determining the shared operand to be the pixel value of the input feature map, and
wherein plural of the processing units are configured to perform the parallelized operations between the weight value of the kernel and respective pixel values of the input feature map as the respective operand, in parallel, in response to the controller determining the shared operand to be the weight value of the kernel.
|