US 12,106,154 B2
Serverless computing architecture for artificial intelligence workloads on edge for dynamic reconfiguration of workloads and enhanced resource utilization
Yamini Nimmagadda, Portland, OR (US); Akhila Vidiyala, Beaverton, OR (US); Suryaprakash Shanmugam, Santa Clara, CA (US); and Divya Prakash, Santa Clara, CA (US)
Assigned to Intel Corporation, Santa Clara, CA (US)
Filed by Intel Corporation, Santa Clara, CA (US)
Filed on Aug. 19, 2021, as Appl. No. 17/406,711.
Prior Publication US 2021/0382754 A1, Dec. 9, 2021
Int. Cl. G06F 9/46 (2006.01); G06F 9/50 (2006.01); G06N 3/08 (2023.01)
CPC G06F 9/505 (2013.01) [G06F 9/5016 (2013.01); G06F 9/5072 (2013.01); G06N 3/08 (2013.01)] 25 Claims
OG exemplary drawing
 
1. A computing system comprising:
a network controller to communicate with one or more nodes including hardware devices;
a processor coupled to the network controller; and
a memory coupled to the processor, the memory including a set of executable program instructions, which when executed by the processor, cause the processor to:
analyze an input stream and an artificial intelligence (AI) model graph to generate a workload characterization, wherein the workload characterization is to characterize one or more of compute resources or memory resources, wherein the one or more of the compute resources or the memory resources is associated with execution of the AI model graph based on the input stream;
partition the AI model graph into subgraphs based on the workload characterization; and
select a plurality of the hardware devices to execute the subgraphs.