| CPC G06F 40/295 (2020.01) [G06F 40/274 (2020.01)] | 19 Claims |

|
1. A method comprising:
receiving, by a processing device, input data describing a sequence of words ending with a last word;
predicting, by the processing device, a next word after the last word in the sequence of words by processing the input data using a machine learning model trained on injected Gaussian noise and training data to update parameters of the machine learning model to predict next words after last words in sequences of words, the training data describing a corpus of text associated with clients and including sensitive samples and non-sensitive samples taken from databases that are client-content adjacent as differing in that a client and a sensitive entity are present in one of the client-content adjacent databases and are not present in another one of the client-content adjacent databases; and
generating, by the processing device, an indication of the next word after the last word in the sequence of words for display in a user interface.
|