| CPC G06F 9/547 (2013.01) [G06F 9/45558 (2013.01); G06N 20/00 (2019.01); H04L 67/133 (2022.05); G06F 2009/45562 (2013.01); G06F 2009/45591 (2013.01); G06F 2009/45595 (2013.01)] | 20 Claims |

|
1. A method, comprising:
receiving a request, from a client device, the request comprising input data for a desired machine learning model;
selecting an execution environment with a loaded instance of the desired machine learning model by:
identifying one or more active virtual machines that are associated with an application programming interface (API) endpoint;
determining a virtual machine of the one or more active virtual machines with an available execution environment that has the loaded instance of the desired machine learning model; and
selecting the available execution environment to be the execution environment; and
instructing the API endpoint to forward the input data to the selected execution environment to obtain output data that is returned to the client device.
|