| CPC G10L 15/065 (2013.01) [G10L 15/01 (2013.01); G10L 15/063 (2013.01); G10L 15/18 (2013.01); G10L 15/26 (2013.01); G10L 15/30 (2013.01)] | 20 Claims |

|
11. A system comprising:
data processing hardware; and
memory hardware in communication with the data processing hardware and storing instructions that when executed on the data processing hardware cause the data processing hardware to perform the operations comprising:
receiving a distilled model identified for execution on a target client device, the distilled model related to a corresponding cloud-based model;
receiving at least one of memory constraints or processing constraints of the target client device;
selecting a model configuration for the distilled model based on the at least one of the memory constraints or the processing constraints of the target client device;
processing, using the distilled model having the selected model configuration, an evaluation data set to generate evaluation results indicating an accuracy of the distilled model; and
based on the evaluation results indicating the accuracy of the distilled model, deploying the distilled model having the selected model configuration to the target client device.
|