| CPC G06F 9/5027 (2013.01) | 20 Claims |

|
1. A computer-implemented method comprising:
receiving, from a device connected by a network to a content management system, workload data requesting execution of a task using a machine-learning model;
extracting, from the workload data, workload features defining characteristics of the task;
determining task routing metrics for a plurality of hardware environments hosted in respective network environments and a plurality of machine-learning models in respective network environments;
determining a historical quality metric indicating how one or more machine learning models of the plurality of machine-learning models will execute the task based on one or more user feedback metrics; and
selecting, based on an output of a model selection machine-learning model that utilizes the historical quality metric, from the plurality of hardware environments and from the plurality of machine-learning models, a designated hardware environment and a designated machine-learning model for executing the task based on the workload features and the task routing metrics.
|