| CPC G06N 3/084 (2013.01) [G06N 3/04 (2013.01)] | 20 Claims |

|
1. A computer-implemented method of deploying a machine-learned model, the computer-implemented method comprising:
obtaining, by one or more computing devices, data descriptive of a neural network comprising one or more network units and one or more gating units, the one or more gating units comprising one or more gating paths associated with the one or more network units;
training, by the one or more computing devices, the neural network to learn one or more network parameters of the one or more network units and one or more gating parameters of the one or more gating units;
sparsifying, by the one or more computing devices, the neural network based at least in part on the one or more gating parameters of the one or more gating units to generate a sparsified neural network; and
deploying the sparsified neural network to perform inference.
|