CPC G06F 9/4881 (2013.01) [G06F 9/5038 (2013.01); G06F 16/9024 (2019.01); G06N 3/04 (2013.01); G06N 3/08 (2013.01)] | 25 Claims |
1. A computing system, comprising:
a processor; and
a memory coupled to the processor to store instructions which, when executed by the processor, cause the processor to:
partition a graph into a plurality of clusters comprising batched clusters that support batched data and non-batched clusters that fail to support batched data;
establish an execution queue for execution of the plurality of clusters based on cluster dependencies; and
schedule inference execution of the plurality of clusters in the execution queue based on batch size.
|