US 12,452,259 B2
	Web threat investigation using advanced web crawling
John Morgan, San Diego, CA (US); Rob Tietje, San Diego, CA (US); David Wang, San Diego, CA (US); Tejas Pattabhi, San Diego, CA (US); and Josh Biol, Broomfield, CO (US)
Assigned to OPEN TEXT INC., Menlo Park, CA (US)
Filed by Open Text Inc., Menlo Park, CA (US)
Filed on Dec. 13, 2021, as Appl. No. 17/549,313.
Application 17/549,313 is a continuation of application No. 16/021,630, filed on Jun. 28, 2018, granted, now 11,201,875.
Prior Publication US 2022/0217160 A1, Jul. 7, 2022
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 21/56 (2013.01); G06F 16/951 (2019.01); H04L 9/40 (2022.01); H04L 29/06 (2006.01); G06F 21/60 (2013.01)

CPC H04L 63/1416 (2013.01) [G06F 16/951 (2019.01); H04L 63/1425 (2013.01)]

18 Claims

1. A computer-implemented method comprising:

receiving a seed resource identifier;

determining a related resource identifier associated with the seed resource identifier;

evaluating the related resource identifier to determine a classification of the related resource identifier, evaluating the related resource identifier comprising:

determining a third resource identifier associated with the related resource identifier, wherein determining the third resource identifier comprises scanning a related resource corresponding to the related resource identifier to determine a resource made available via the related resource identifier in a webpage corresponding to the related resource identifier; and

processing the third resource identifier with a machine learning model to classify the third resource identifier according to a classification representing a predicted level of threat, the machine learning model trained to classify resource identifiers into a plurality of classifications, the plurality of classifications comprising:

a first category for safe resource identifiers; and

a plurality of additional categories, the plurality of additional categories representing different levels of threat;

classifying the related resource identifier based on a classification of the third resource identifier; and

executing a corrective action based on the classification of the related resource identifier, wherein executing the corrective action comprises modifying at least one of a permission or a privilege level.