US 12,430,181 B2
Method and apparatus for partitioning neural network data
Hanwoong Jung, Seoul (KR); Joonho Song, Hwaseong-si (KR); and Seungwon Lee, Hwaseong-si (KR)
Assigned to Samsung Electronics Co., Ltd., Suwon-si (KR)
Filed by SAMSUNG ELECTRONICS CO., LTD., Suwon-si (KR)
Filed on Sep. 7, 2021, as Appl. No. 17/467,890.
Claims priority of application No. 10-2020-0167681 (KR), filed on Dec. 3, 2020.
Prior Publication US 2022/0179714 A1, Jun. 9, 2022
Int. Cl. G06F 9/48 (2006.01); G06F 9/50 (2006.01); G06N 3/063 (2023.01)
CPC G06F 9/5061 (2013.01) [G06F 9/4881 (2013.01); G06N 3/063 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A processor-implemented method, comprising:
dividing an input feature map or weight, respectively of a layer of a neural network, into partitions to be assigned to cores in different ways depending on whether an output of a previous layer is forwardable;
generating tiles by dividing each one of the partitions; and
scheduling an operation order of the tiles based on whether values of the input feature map or the weight are shared between the cores.