US 12,141,703 B2
Minimum deep learning with gating multiplier
Gil Shamir, Sewickley, PA (US)
Assigned to GOOGLE LLC, Mountain View, CA (US)
Filed by Google LLC, Mountain View, CA (US)
Filed on Sep. 14, 2023, as Appl. No. 18/467,207.
Application 18/467,207 is a division of application No. 16/809,096, filed on Mar. 4, 2020, granted, now 11,790,236.
Prior Publication US 2024/0005166 A1, Jan. 4, 2024
Int. Cl. G06N 3/084 (2023.01); G06N 3/04 (2023.01)
CPC G06N 3/084 (2013.01) [G06N 3/04 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A computer-implemented method of deploying a machine-learned model, the computer-implemented method comprising:
obtaining, by one or more computing devices, data descriptive of a neural network comprising one or more network units and one or more gating units, the one or more gating units comprising one or more gating paths associated with the one or more network units;
training, by the one or more computing devices, the neural network to learn one or more network parameters of the one or more network units and one or more gating parameters of the one or more gating units;
sparsifying, by the one or more computing devices, the neural network based at least in part on the one or more gating parameters of the one or more gating units to generate a sparsified neural network; and
deploying the sparsified neural network to perform inference.