| CPC G06F 21/577 (2013.01) [G06F 21/31 (2013.01); G06N 3/04 (2013.01); G06N 3/08 (2013.01); G06F 2221/033 (2013.01)] | 20 Claims |

|
1. A system, comprising:
memory to store at least a portion of a neural network; and
one or more processors coupled with the memory to perform operations to:
train a neural network on a corpus of content, wherein the training results in word embeddings for words in the corpus, wherein each of the word embeddings is a numeric vector in a vector or matrix space;
identify an initial word of interest;
locate a vector that encodes the initial word of interest in the vector or matrix space;
identify vectors in the vector or matrix space that lie in a specified proximity to the vector for the initial word of interest and identifying words encoded by the identified vectors as additional words of interest, wherein the identifying comprises identifying ones of the vectors in the vector or matrix space that have distances or cosine values within a specified range as being in the specified proximity;
perform a security scan of a set of input to identify instances of the initial word of interest and instances of the additional words of interest in the input; and
generate output that specifies the identified instances of the initial word of interest in the set of input and that specifies that the instances of the additional words of interest in the set of input may be of interest.
|