CPC G06N 3/063 (2013.01) [G06N 3/04 (2013.01)] | 22 Claims |
1. A device, comprising:
a main processing circuit; and
a plurality of basic processing circuits, at least a subset of the plurality of basic processing circuits being connected to the main processing circuit,
wherein:
the main processing circuit is configured to:
generate an input data block and a weight data block, at least one of the input data block or the weight data block containing data of a fixed point type;
partition the input data block into a plurality of basic data blocks;
distribute the plurality of basic data blocks to at least one of the subset of the plurality of basic processing circuits connected to the main processing circuit; and
broadcast the weight data block to the subset of the plurality of basic processing circuits connected to the main processing circuit;
the at least one of the subset of the plurality of basic processing circuits is configured to:
perform computations using the weight data block broadcasted to the basic processing circuit and one or more basic data blocks distributed to the basic processing circuit to obtain a computation result; and
transfer the computation result to the main processing circuit; and
the main processing circuit is configured to process the computation result to obtain an instruction result.
|