CPC G06N 3/082 (2013.01) [G06F 18/2148 (2023.01); G06N 3/098 (2023.01); G06N 20/00 (2019.01)] | 30 Claims |
1. A method for training a machine learning model, comprising:
computing using a first batch of training data, a first gradient tensor comprising a gradient for each parameter of a parameter tensor for a machine learning model;
identifying a first subset of gradients in the first gradient tensor based on evaluating each respective gradient of the first gradient tensor using a first gradient criteria; and
updating a first subset of parameters in the parameter tensor based on the first subset of gradients.
|