CPC G06N 3/088 (2013.01) [G06F 18/24 (2023.01); G06N 3/04 (2013.01); G06N 3/044 (2023.01); G06N 3/045 (2023.01); G06N 3/047 (2023.01); G06N 3/063 (2013.01); G06N 3/084 (2013.01); G06N 7/01 (2023.01); G06N 20/00 (2019.01); G06F 12/0815 (2013.01); G06F 17/18 (2013.01); G06N 3/048 (2023.01)] | 36 Claims |
1. A method for improving performance of a classifier, wherein the classifier comprises a deep neural network that is trained iteratively through machine learning, wherein the deep neural network of the classifier comprises an input layer, an output layer, and one or more inner layers, wherein the layers comprise interconnected nodes that process and transform input data through weighted connections, the method comprising:
computing, by the classifier implemented by a computer system, first and second scores for each of multiple data examples from a generator, wherein the first score is indicative of whether the data example belongs to a first data cluster and the second score is indicative of whether the data example belongs to a second data cluster, wherein the generator is trained with an objective such that, for each data example generated by the generator, the first and second scores computed by the classifier are equal;
back-propagating, by the computer system, for multiple data examples generated by the generator, partial derivatives from the classifier to obtain a vector, for each data example, that is orthogonal to a decision surface for the classifier;
detecting, by the computer system, a problem with the classifier based on changes in directions of the vectors; and
upon detecting a problem, adjusting, by the computer system, the classifier to reduce errors by the classifier caused by overfitting training data.
|