CPC G06N 3/08 (2013.01) [G06F 17/15 (2013.01); G06F 18/24 (2023.01); G06N 3/063 (2013.01); G06N 3/082 (2013.01)] | 20 Claims |
1. A computer-implemented method comprising:
computing, from a dataset, a first set of features that have a first dimension; and
generating a second set of features from the first set of features using a neural network having a set of neural network parameters and a compression method that compresses the first set of features by entropy coding the first set of features using a probability distribution jointly learned with the set of neural network parameters,
wherein the second set of features has a second dimension that is smaller than the first dimension and wherein jointly learning the probability distribution with the set of neural network parameters maximizes a prediction accuracy of the neural network on a prediction task while minimizing a bit rate of the second set of features.
|