| CPC G06F 9/4881 (2013.01) [G06F 17/16 (2013.01); G06N 3/04 (2013.01); G06N 3/08 (2013.01); G06N 3/10 (2013.01)] | 20 Claims |

|
1. A method of scheduling computational matrix operations for execution, the method comprising:
selecting a data sample batch for a particular neural network model;
creating a set of work queues for the data sample batch;
determining a number of active queues within the set of work queues to be allowed to be processed simultaneously;
setting a priority level for each computational matrix operation in the set of work queues;
determining if preemption will be allowed during the execution, the preemption allowing a first computational matrix operation to be paused for a second computational matrix operation in the set of work queues; and
executing a computational matrix operation on the data sample batch in the set of work queues.
|