| CPC G06F 9/5077 (2013.01) [G06F 9/5016 (2013.01); G06F 9/5033 (2013.01); G06F 9/5083 (2013.01); G06N 20/00 (2019.01)] | 18 Claims |

|
1. A method for dynamic virtual machine (VM) provisioning across cloud service providers, comprising:
transmitting instructions across computer networks to hypervisors executing on computers of the cloud service providers to provide a VM pool that includes a first graphics processing unit (GPU)-optimized VM and a first non-GPU-optimized VM;
receiving, at a control plane for the VM pool, an indication that a user of the first non-GPU-optimized VM has submitted a request through the first non-GPU-optimized VM, wherein the request specifies a workload;
determining, by the control plane, that the first GPU-optimized VM is available for using a GPU to execute the workload, and issuing, by the control plane upon said determining, an instruction to the first non-GPU-optimized VM to send the workload to the first GPU-optimized VM; and
transmitting an instruction across a computer network to a computer of one of the cloud service providers to instantiate a second GPU-optimized VM to increase the amount of GPU-optimized VMs of the VM pool that are available for using GPUs to execute workloads, to a minimum number.
|