CPC G06N 20/00 (2019.01) [G06N 3/088 (2013.01)] | 20 Claims |
1. A method, comprising:
training, by a machine learning system comprising at least one processor and a memory, a classifier that, when applied to one or more data sets determines classifications of one or more labels for the one or more data sets, the training comprising:
labeling unlabeled data according to a specified amount of bias to generate labeled data comprising the specified amount of bias, wherein the specified amount of bias comprises one or more of a specified amount of label bias and a specified amount of selection bias in at least one dimension of a plurality of label dimensions;
generating a training data set comprising samples of the labeled data and additional unlabeled data; and
training the classifier according to the generated training data set and training parameters comprising an indication the specified amount of bias.
|