US 11,954,584 B2
Neural core, neural processing device including same, and method for loading data of neural processing device
Jinseok Kim, Seongnam-si (KR); Kyeongryeol Bong, Seongnam-si (KR); Jinwook Oh, Seongnam-si (KR); and Yoonho Boo, Seongnam-si (KR)
Assigned to Rebellions Inc., Seongnam-si (KR)
Filed by Rebellions Inc., Seongnam-si (KR)
Filed on May 23, 2023, as Appl. No. 18/322,519.
Claims priority of application No. 10-2022-0084478 (KR), filed on Jul. 8, 2022.
Prior Publication US 2024/0013038 A1, Jan. 11, 2024
Int. Cl. G06F 9/54 (2006.01); G06N 3/063 (2023.01); G06F 12/084 (2016.01)
CPC G06N 3/063 (2013.01) [G06F 9/544 (2013.01); G06F 12/084 (2013.01)] 18 Claims
OG exemplary drawing
 
1. A neural core comprising:
a processing unit configured to perform operations;
an L0 memory configured to store input data; and
a load/store unit (LSU) configured to perform a load task and a store task of data between the processing unit and the L0 memory,
wherein the LSU comprises a local memory load unit configured to transmit the input data in the L0 memory to the processing unit, and
the local memory load unit comprises:
a target decision module configured to identify and retrieve the input data in the L0 memory;
a transformation logic configured to transform the input data and thereby generate transformed data by performing a merge operation to transform an order of data granules of the input data, and wherein:
the input data comprises first and second input data,
the transformed data comprises first and second transformed data,
the first input data comprises first and second data granules of same size each other,
the second input data comprises third and fourth data granules of same size each other,
the first transformed data comprises the first and third data granules, and
the second transformed data comprises the second and fourth data granules; and
an output first in first out (FIFO) configured to receive the transformed data and transmit the transformed data to the processing unit in the received order.