US 12,244,617 B2
Machine learning based anomaly detection initialization
Jeevan Tambuluri, Santa Clara, CA (US); Ravi Ithal, Los Altos, CA (US); Steve Malmskog, San Jose, CA (US); Abhay Kulkarni, Cupertino, CA (US); Ariel Faigon, Santa Clara, CA (US); and Krishna Narayanaswamy, Saratoga, CA (US)
Assigned to Netskope, Inc., Santa Clara, CA (US)
Filed by Netskope, Inc., Santa Clara, CA (US)
Filed on Jul. 5, 2023, as Appl. No. 18/347,498.
Application 18/347,498 is a continuation of application No. 17/332,879, filed on May 27, 2021, granted, now 11,743,275.
Application 17/332,879 is a continuation of application No. 16/389,861, filed on Apr. 19, 2019, granted, now 11,025,653, issued on Jun. 1, 2021.
Application 16/389,861 is a continuation of application No. 15/256,483, filed on Sep. 2, 2016, granted, now 10,270,788, issued on Apr. 23, 2019.
Claims priority of provisional application 62/346,382, filed on Jun. 6, 2016.
Prior Publication US 2023/0344841 A1, Oct. 26, 2023
Int. Cl. H04L 9/40 (2022.01); G06F 21/55 (2013.01); G06F 21/62 (2013.01); G06N 5/02 (2023.01); G06N 7/01 (2023.01); G06N 20/00 (2019.01)
CPC H04L 63/1416 (2013.01) [G06F 21/554 (2013.01); G06F 21/6209 (2013.01); G06N 5/02 (2013.01); G06N 7/01 (2023.01); G06N 20/00 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A method of initializing an anomaly detector that handles a stream of security-related events of one or more organizations, the method comprising:
feeding, to an online machine learner, the stream of security-related events, each security-related event comprising a space identifier (ID) of a plurality of space IDs and one or more feature-value pairs;
transforming the security-related events, the transforming comprising:
assigning the one or more feature-value pairs of each security-related event in the stream of security-related events into a plurality of categorical bins, and
coding the assigned feature-value pairs with a Boolean value representing the feature-value pair associated with the respective categorical bins of the plurality of categorical bins;
analyzing the stream of transformed security-related events using a loss function analyzer of the online machine learner, the analyzing comprising:
grouping transformed security-related events in the stream of transformed security-related events into sub-streams by the space ID of the respective transformed security-related event, and
separately analyzing each sub-stream with the loss function analyzer, the analyzing comprising:
correlating the coded feature-value pairs of the sub-stream with a target feature artificially labeled as a constant to generate a probability prediction for each of the coded feature-value pairs; and
storing the probability predictions for each of the coded feature-value pairs associated with the space ID associated with the respective sub-stream; and
initializing the anomaly detector using the probability predictions.