US 12,380,322 B2
Method and apparatus with neural network operation
Dongwook Lee, Suwon-si (KR); Changyong Son, Anyang-si (KR); Jinwoo Son, Seoul (KR); Jaehyoung Yoo, Seongnam-si (KR); and Jaejoon Han, Seoul (KR)
Assigned to Samsung Electronics Co., Ltd., Suwon-si (KR)
Filed by SAMSUNG ELECTRONICS CO., LTD., Suwon-si (KR)
Filed on Apr. 30, 2021, as Appl. No. 17/245,144.
Claims priority of application No. 10-2020-0122313 (KR), filed on Sep. 22, 2020.
Prior Publication US 2022/0092394 A1, Mar. 24, 2022
Int. Cl. G06N 3/063 (2023.01); G06F 9/50 (2006.01); G06N 3/0464 (2023.01)
CPC G06N 3/063 (2013.01) [G06F 9/5016 (2013.01); G06N 3/0464 (2023.01)] 30 Claims
OG exemplary drawing
 
1. A processor-implemented method, the method comprising:
generating a first intermediate operation result by performing a first-order partial operation of a neural network layer on a first input line of a first area of a channel of a frame, and storing the first intermediate operation result in a buffer;
generating a second intermediate operation result by performing another first-order partial operation of the neural network layer on a second input line of the first area;
generating an objective operation result of the neural network layer with respect to the first area based on a second-order partial operation performed on the first intermediate operation result and the second intermediate operation result;
reusing the buffer to store an intermediate operation result with respect to another area of the frame after the objective operation result is generated; and
completing, before all pixel values of the frame are received, respective objective operation results of plural subsequent neural network layers dependent on the generated objective operation result of the neural network layer with respect to the first are a,
wherein the neural network layer and the plural subsequent neural network layers are layers of a same neural network.