US 12,112,560 B2
Usage based resource utilization of training pool for chatbots
Xin Xu, San Jose, CA (US); Suman Mallapura Somasundar, Sunnyvale, CA (US); Vishal Vishnoi, Redwood City, CA (US); Xinwei Zhang, Redwood City, CA (US); and Ping L. Lin, Foster City, CA (US)
Assigned to Oracle International Corporation, Redwood Shores, CA (US)
Filed by Oracle International Corporation, Redwood Shores, CA (US)
Filed on Jan. 7, 2022, as Appl. No. 17/570,804.
Claims priority of provisional application 63/139,567, filed on Jan. 20, 2021.
Prior Publication US 2022/0230462 A1, Jul. 21, 2022
Int. Cl. G06V 30/19 (2022.01); H04L 51/02 (2022.01)
CPC G06V 30/19113 (2022.01) [G06V 30/19147 (2022.01); H04L 51/02 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method comprising:
receiving a training request, the training request including a first identifier indicative of a type of a machine learning model that is to be trained;
maintaining a plurality of training jobs in a queue of training jobs;
maintaining a plurality of workers in a training pool, wherein each worker is configured to train a particular type of machine learning model; and
responsive to the training request being validated:
creating a training job associated with the training request;
submitting the training job to the queue of training jobs;
obtaining, for each type of machine learning model, a first metric corresponding to a first number of the type of machine learning models included in the queue of training jobs;
obtaining, for each type of machine learning model, a second metric corresponding to a second number of workers included in the training pool, wherein the second number of workers are programmed to train the type of machine learning model;
computing a target metric based on the first metric and the second metric; and
modifying the second number of workers included in the training pool based on the target metric.