| CPC H04L 61/3005 (2013.01) [H04L 61/4511 (2022.05); H04L 63/1483 (2013.01)] | 20 Claims |

|
1. A method, comprising:
obtaining, by a computer, a domain name, a seed value, and an identification of a substring of the domain name that is relevant match to the seed value;
determining, by the computer, key-value pairs that encode information about terms in substrings of the domain name, the terms including a term in the substring of the domain name, wherein the determining comprises:
obtaining a language model for the term in the substring of the domain name;
analyzing a cluster of language models closest to the language model for the term in the substring of the domain name, wherein the analyzing the cluster of language models comprises analyzing a plurality of language models within a predetermined threshold distance from the language model for the term in the substring of the domain name; and
determining, based on the analyzing, a relevance of the term in the substring of the domain name to the seed value; and
providing, by the computer, the relevance of the term in the substring of the domain name to the seed value to a computing device.
|