US 12,231,390 B2
Domain name classification systems and methods
Sharon Huffner, Munich (DE); and Ali Mesdaq, San Jose, CA (US)
Assigned to Proofpoint, Inc., Sunnyvale, CA (US)
Filed by Proofpoint, Inc., Sunnyvale, CA (US)
Filed on Sep. 29, 2023, as Appl. No. 18/478,564.
Application 18/478,564 is a continuation of application No. 17/500,915, filed on Oct. 13, 2021, granted, now 11,799,823.
Application 17/500,915 is a continuation of application No. 16/866,297, filed on May 4, 2020, granted, now 11,171,916, issued on Nov. 9, 2021.
Application 16/866,297 is a continuation of application No. 15/687,660, filed on Aug. 28, 2017, granted, now 10,673,814, issued on Jun. 2, 2020.
Prior Publication US 2024/0039886 A1, Feb. 1, 2024
This patent is subject to a terminal disclaimer.
Int. Cl. H04L 61/30 (2022.01); H04L 9/40 (2022.01); H04L 61/4511 (2022.01)
CPC H04L 61/3005 (2013.01) [H04L 61/4511 (2022.05); H04L 63/1483 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method, comprising:
obtaining, by a computer, a domain name, a seed value, and an identification of a substring of the domain name that is relevant match to the seed value;
determining, by the computer, key-value pairs that encode information about terms in substrings of the domain name, the terms including a term in the substring of the domain name, wherein the determining comprises:
obtaining a language model for the term in the substring of the domain name;
analyzing a cluster of language models closest to the language model for the term in the substring of the domain name, wherein the analyzing the cluster of language models comprises analyzing a plurality of language models within a predetermined threshold distance from the language model for the term in the substring of the domain name; and
determining, based on the analyzing, a relevance of the term in the substring of the domain name to the seed value; and
providing, by the computer, the relevance of the term in the substring of the domain name to the seed value to a computing device.