CPC G06F 12/0875 (2013.01) [G06F 9/30043 (2013.01); G06F 9/30047 (2013.01); G06F 9/383 (2013.01); G06F 15/8023 (2013.01); G06N 3/04 (2013.01); G06F 2212/1016 (2013.01)] | 20 Claims |
1. A neural processing device comprising:
a processing unit configured to receive an input activation and a weight and perform a two-dimensional matrix calculation with the input activation and the weight to generate an output activation;
a first memory; and
a load-store unit (LSU) configured to perform memory access operations between the first memory and a second memory,
wherein the memory access operations include a main memory access operation for a current processing operation that is performed by the processing unit, and a standby memory access operation for a standby processing operation that is performed by the processing unit after the current processing operation,
wherein a level of the first memory is equal to a level of the processing unit, and a level of the second memory is different from the level of the first memory, and
wherein the LSU comprises:
a main LSU configured to perform the main memory access operation between the first memory and the second memory; and
a hidden LSU configured to perform the standby memory access operation between the first memory and the second memory.
|