CPC G06F 13/4068 (2013.01) [G06N 3/04 (2013.01)] | 19 Claims |
1. A network-on-chip (NoC) processing system, comprising a storage device and a plurality of computation device clusters, wherein the storage device and the plurality of computation device clusters are arranged on a same chip, each computation device cluster includes a plurality of computation devices, wherein at least one of the plurality of computation device clusters is connected to the storage device, and at least two computation device clusters are connected to each other,
wherein at least one computation device of the plurality of computation device clusters is configured to perform a machine learning computation, and the computation device includes an operation unit and a controller unit, wherein the operation unit includes a primary processing circuit and a plurality of secondary processing circuits, wherein
the controller unit is configured to obtain input data and a computation instruction;
the controller unit is further configured to parse the computation instruction to obtain a plurality of operation instructions, and send the plurality of operation instructions and the input data to the primary processing circuit;
the primary processing circuit is configured to perform preorder processing on the input data, and send the data and the operation instructions among the primary processing circuit and the plurality of secondary processing circuits;
the plurality of secondary processing circuits are configured to perform intermediate computations in parallel according to the data and the operation instructions sent by the primary processing circuit to obtain a plurality of intermediate results, and send the plurality of intermediate results to the primary processing circuit; and
the primary processing circuit is further configured to perform postorder processing on the plurality of intermediate results to obtain a computation result of the computation instruction.
|