| CPC G06F 21/566 (2013.01) [G06F 21/52 (2013.01); G06N 3/04 (2013.01); G06N 3/08 (2013.01); G06F 2221/033 (2013.01)] | 14 Claims |

|
1. A method for identifying malicious software on a user device, the method comprising:
receiving, at data processing hardware, an application install pattern from a user device, the application install pattern indicating a sequence of n-applications installed on the user device;
for each application in the sequence of n-applications:
generating, by the data processing hardware, a numerical vector representation for the corresponding application using a feed-forward neural network model configured to receive each application and the order of each application in the sequence of n-applications as feature inputs; and
clustering, by the data processing hardware, the corresponding application in a free vector space based on the numerical vector representation for the corresponding application;
determining, by the data processing hardware, whether any of the applications in the sequence of n-applications are clustered with training applications identified as malware; and
for each application clustered with training applications identified as malware, identifying, by the data processing hardware, the corresponding application in the sequence of n-applications as malware.
|