CPC G06F 9/505 (2013.01) [G06F 9/5016 (2013.01); G06F 9/5072 (2013.01); G06N 3/08 (2013.01)] | 25 Claims |
1. A computing system comprising:
a network controller to communicate with one or more nodes including hardware devices;
a processor coupled to the network controller; and
a memory coupled to the processor, the memory including a set of executable program instructions, which when executed by the processor, cause the processor to:
analyze an input stream and an artificial intelligence (AI) model graph to generate a workload characterization, wherein the workload characterization is to characterize one or more of compute resources or memory resources, wherein the one or more of the compute resources or the memory resources is associated with execution of the AI model graph based on the input stream;
partition the AI model graph into subgraphs based on the workload characterization; and
select a plurality of the hardware devices to execute the subgraphs.
|