US 12,277,451 B2
Dynamic GPU-enabled virtual machine provisioning across cloud providers
Sam Zhao, Beijing (CN); Bomin Nie, Beijing (CN); Nan Wang, Beijing (CN); Jingtao Zhang, Beijing (CN); and Zhifu Jin, Beijing (CN)
Assigned to Omnissa, LLC, Mountain View, CA (US)
Filed by Omnissa, LLC, Mountain View, CA (US)
Filed on Feb. 21, 2022, as Appl. No. 17/676,397.
Claims priority of application No. PCT/CN2022/073240 (WO), filed on Jan. 21, 2022.
Prior Publication US 2023/0236902 A1, Jul. 27, 2023
Int. Cl. G06F 9/50 (2006.01); G06N 20/00 (2019.01)
CPC G06F 9/5077 (2013.01) [G06F 9/5016 (2013.01); G06F 9/5033 (2013.01); G06F 9/5083 (2013.01); G06N 20/00 (2019.01)] 18 Claims
OG exemplary drawing
 
1. A method for dynamic virtual machine (VM) provisioning across cloud service providers, comprising:
transmitting instructions across computer networks to hypervisors executing on computers of the cloud service providers to provide a VM pool that includes a first graphics processing unit (GPU)-optimized VM and a first non-GPU-optimized VM;
receiving, at a control plane for the VM pool, an indication that a user of the first non-GPU-optimized VM has submitted a request through the first non-GPU-optimized VM, wherein the request specifies a workload;
determining, by the control plane, that the first GPU-optimized VM is available for using a GPU to execute the workload, and issuing, by the control plane upon said determining, an instruction to the first non-GPU-optimized VM to send the workload to the first GPU-optimized VM; and
transmitting an instruction across a computer network to a computer of one of the cloud service providers to instantiate a second GPU-optimized VM to increase the amount of GPU-optimized VMs of the VM pool that are available for using GPUs to execute workloads, to a minimum number.