CPC G06N 20/20 (2019.01) [G06F 40/126 (2020.01); G06F 40/284 (2020.01)] | 20 Claims |
1. A computer-implemented method comprising:
receiving a first input data of a first task type during a runtime, the first task type indicating first processing to be performed by one or more application instances during the same runtime;
dynamically generating a first model tuned to generate predictions for the first task type, the generating comprising integrating, into a previously-trained base model during the same runtime, a first model artifact comprising one or more adapter layers specific to the first task type;
generating, during the same runtime and based on processing the first input data with the first model generated during the same runtime, a prediction for the first input data;
distributing the prediction to the one or more application instances during the same runtime, thereby enabling the first processing;
receiving a second input data of a second task type during the same runtime, the second task type being different from the first task type and indicating second processing to be performed by the one or more application instances during the same runtime;
generating, during the same runtime, a second model tuned to generate predictions for the second task type, the generating comprising dynamically exchanging the first model artifact with a second model artifact comprising one or more adapter layers specific to the second task type;
generating, during the same runtime, a second prediction for the second input data by processing the second input data using the second model; and
distributing the second prediction to the one or more application instances during the same runtime, thereby enabling the second processing.
|