| CPC G06F 9/544 (2013.01) [G06F 9/461 (2013.01); G06F 9/4812 (2013.01); G06F 9/4881 (2013.01); G06F 9/5027 (2013.01); G06F 9/5066 (2013.01); G06F 2209/486 (2013.01); G06F 2209/5017 (2013.01)] | 18 Claims |

|
1. An apparatus comprising:
one or more neural processors configured to perform neural network model tasks;
a command processor configured to distribute neural network model tasks to the one or more neural processors; and
a shared memory shared by the one or more neural processors,
wherein the command processor is configured to cause:
receiving a context start signal indicating a start of a context of a neural network model from a host system;
determining whether neural network model data is entirely or partially updated based on the context start signal;
based on a determination that neural network model data is entirely updated, directly accessing a memory in the host system to read one or more context descriptors and to store the one or more context descriptors in the shared memory, and directly accessing the memory in the host system based on the one or more context descriptors to read the entire neural network model data and to update the neural network model data in the shared memory with the entire neural network model data;
based on a determination that neural network model data is partially updated, directly accessing the memory in the host system based on the context start signal to read partial neural network model data and to update the neural network model data in the shared memory with the partial neural network model data;
generating a plurality of task descriptors describing neural network model tasks based on the neural network model data; and
distributing the plurality of task descriptors to the one or more neural processors.
|