CPC G06F 9/445 (2013.01) [G06F 9/5027 (2013.01); G06F 16/1744 (2019.01); G06N 3/045 (2023.01); G06N 3/082 (2013.01)] | 20 Claims |
1. A method for loading multi neural network model comprising:
compiling at least two neural network models and generating at least two binary model files corresponding to the at least two neural network models;
taking one of the at least two binary model files as the basic model, calculating and recording the difference between each binary model file except the basic model in the at least two binary model files and the basic model using preset difference calculation method, and generating a differences file;
compressing the basic model and the differences file using a preset compression method, and generating an input file; and
inputting the input file in a neural network accelerator, decompressing the input file to obtain the basic model and the differences file, and loading the basic model and the differences file in the neural network accelerator.
|