CPC G06F 9/547 (2013.01) [G06F 9/45558 (2013.01); G06N 20/00 (2019.01); H04L 67/133 (2022.05); G06F 2009/45562 (2013.01); G06F 2009/45591 (2013.01); G06F 2009/45595 (2013.01)] | 20 Claims |
1. A method comprising:
instantiating, at each virtual machine of one or more virtual machines, a machine learning model execution environment for an instance of a machine learning model;
loading, by a processing device, a respective instance of the machine learning model to each machine learning model execution environment;
associating each loaded instance of the machine learning model with an application programming interface (API) endpoint, the API endpoint configured to receive input data for the loaded instance of the machine learning model from a client device and to return output data produced by the loaded instance of the machine learning model based on the input data;
receiving a request by the client device to configure the API endpoint; and
identifying configuration information specified by the request, wherein an identifier of the machine learning model and a resource locator of the API endpoint are specified by the configuration information.
|