US 11,973,768 B2
	Method and system for detecting malicious payloads
Nicolas Beauchesne, Miami Beach, FL (US); and John Steven Mancini, San Jose, CA (US)
Assigned to Vectra AI, Inc., San Jose, CA (US)
Filed by Vectra AI, Inc., San Jose, CA (US)
Filed on Nov. 24, 2020, as Appl. No. 17/103,882.
Application 17/103,882 is a continuation of application No. 15/702,507, filed on Sep. 12, 2017, abandoned.
Claims priority of provisional application 62/393,596, filed on Sep. 12, 2016.
Prior Publication US 2021/0105290 A1, Apr. 8, 2021
Int. Cl. G06F 21/56 (2013.01); G06N 5/022 (2023.01); H04L 9/40 (2022.01); H04W 12/12 (2021.01); G06N 20/00 (2019.01)

CPC H04L 63/1416 (2013.01) [G06F 21/56 (2013.01); G06N 5/022 (2013.01); H04W 12/12 (2013.01); G06N 20/00 (2019.01); H04L 63/0245 (2013.01); H04L 63/1425 (2013.01)]

20 Claims

1. A system for implementing machine learning using communications over respective unique server and port combinations within a network environment, the system comprising:

a metadata processing element that extracts metadata from network traffic for client-server sessions;

a learning module generating and storing a plurality of behavior models for respective unique combinations of a server and a port from the client-server sessions, a respective behavior model of the plurality of behavior models being generated during a training period by:

storing a first baseline pattern for a first communication of a respective unique combination in the respective behavior model corresponding to the respective unique combination,

comparing one or more additional baseline patterns for one or more additional communications to any existing baseline patterns for the respective unique combination to generate a similarity score, and

storing respective additional baseline patterns for the one or more additional communications of a respective unique combination in the respective behavior model corresponding to the respective unique combination based on the similarity score; and

an update module updating the plurality of behavior models for the respective unique combinations of the server and the port from the client-server sessions, the respective behavior model of the plurality of behavior models being updated outside of the training period by:

comparing one or more new baseline patterns for one or more new communications to any existing baseline patterns for the respective unique combination to generate a similarity score, and

storing respective new baseline patterns for the one or more new communications of a respective unique combination in the respective behavior model corresponding to the respective unique combination based on the similarity score and a threshold number of occurrences, wherein the similarity score is calculated using a hamming distance between two baseline patterns.