| CPC G06F 40/30 (2020.01) [G06F 40/284 (2020.01); G06F 40/40 (2020.01)] | 20 Claims |

|
1. A method for producing an embedding model, the method comprising:
receiving identification of a class;
gathering or generating a plurality of pairs of words or tokens associated with the class;
calculating a bias direction of the class in an embedding space, wherein calculating the bias direction includes:
obtaining a corpus of token pairs related to the class;
calculating token embeddings for each token of the token pairs of the corpus using the embedding model;
performing principal component analysis on the embeddings such that variance associated with the class comes to lie on a first principal component that expresses a biased portion of a meaning of the tokens; and
determining the bias direction using the first principal component;
for respective tokens in a series of tokens, calculating a protected gradient score with respect to the bias direction;
aggregating protected gradient scores of the respective tokens to form a series-level protected gradient score for the series of tokens;
determining a fairness indicator based on the protected gradient score for the series; and
responsive to the fairness indicator being above a threshold, permitting deployment of the embedding model.
|