US 12,321,778 B1
Preemptive scheduling for serverless high performance computing
Tiago Costa Tomas, Zaandam (NL)
Assigned to Amazon Technologies, Inc., Seattle, WA (US)
Filed by Amazon Technologies, Inc., Seattle, WA (US)
Filed on Sep. 14, 2021, as Appl. No. 17/474,963.
Int. Cl. G06F 9/46 (2006.01); G06F 9/48 (2006.01); G06F 9/50 (2006.01)
CPC G06F 9/4881 (2013.01) [G06F 9/4825 (2013.01); G06F 9/505 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A system, comprising:
a cloud provider network providing elastic computing capacity under a utility computing model;
a serverless high performance computing (HPC) infrastructure implemented in the cloud provider network and including a plurality of clusters and at least one cluster group to which the plurality of clusters are individually assigned; and
at least one computing device configured to at least:
schedule a plurality of workloads for execution by the serverless HPC infrastructure;
schedule a scheduled interruption of a particular workload of the plurality of workloads, based on one or more workload criteria, wherein the one or more workload criteria comprise a dependency between the particular workload and at one additional workload in the plurality of workloads;
interrupt the execution of the particular workload of the plurality of workloads according to the scheduled interruption;
transfer the particular workload from a first machine instance in a particular cluster of the plurality of clusters to a second machine instance in the particular cluster; and
resume the execution of the particular workload by the second machine instance.