CPC G06F 8/443 (2013.01) [G06F 8/451 (2013.01); G06N 3/10 (2013.01)] | 19 Claims |
1. A processor-implemented compile method for a neural network, the compile method comprising:
receiving data related to the neural network;
generating a grouped layer by grouping layers comprised in the neural network based on the data;
generating a set of passes executable in parallel based on a dependency between a plurality of passes to process the neural network;
generating a set of threads performing a plurality of optimization functions based on whether optimization operations performed by the optimization functions is performed independently for the layers, respectively, or sequentially based on a dependency between the layers; and
performing compilation in parallel based on the grouped layer, the set of passes, and the set of threads.
|