US 11,768,911 B2
Method and apparatus for execution of neural network
Yang Jiao, San Jose, CA (US); and Yijung Su, San Jose, CA (US)
Assigned to Alibaba Group Holding Limited, Grand Cayman (KY)
Filed by ALIBABA GROUP HOLDING LIMITED, Grand Cayman (KY)
Filed on Aug. 26, 2020, as Appl. No. 17/3,354.
Claims priority of provisional application 62/904,953, filed on Sep. 24, 2019.
Prior Publication US 2021/0089611 A1, Mar. 25, 2021
Int. Cl. G06F 9/46 (2006.01); G06F 17/15 (2006.01); G06F 9/48 (2006.01); G06N 3/04 (2023.01)
CPC G06F 17/153 (2013.01) [G06F 9/463 (2013.01); G06F 9/4831 (2013.01); G06F 9/4881 (2013.01); G06N 3/04 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method implemented by a processing unit, the processing unit comprising a command parser having circuitry configured to dispatch commands and computing tasks and at least one core communicatively coupled with the command parser and configured to process the dispatched computing task, each core comprising a convolution unit, a pooling unit, at least one operation unit and a sequencer communicatively coupled with the convolution unit, the pooling unit, and the at least one operation unit and having circuitry configured to distribute instructions of the dispatched computing task to the convolution unit, the pooling unit, and the at least one operation unit for execution, the sequencer further comprising circuitry configured to modify the instructions of the core; the method comprising:
reading, by the convolution unit, data from a local memory of the at least one operation unit;
performing, by the convolution unit, a convolution operation on the data to generate a feature map; and
performing, by the pooling unit, a pooling operation on the feature map.