| CPC G06N 3/063 (2013.01) [G06F 9/5027 (2013.01); G06F 9/5061 (2013.01); G06F 15/80 (2013.01)] | 20 Claims |

|
10. A method for processing a tensor by a neural network accelerator, the method comprising:
obtaining a first output working set generated by a first operation, wherein the first output working set is a set of processed partitioned tensors;
copying the first output working set to a second input working space, wherein the second input working space is a memory portion of the neural network accelerator, corresponding to a second operation;
executing the second operation on the first output working set stored in the second input working space to generate a second output working set; and
copying the second output working set to a second output working space for retrieving by a subsequent operation.
|