US 12,346,362 B2
Mapping webpages to page groups
Slim Frikha, Paris (FR); and Michael Snellman, Paris (FR)
Assigned to Content Square SAS, Paris (FR)
Filed by Content Square SAS, Paris (FR)
Filed on Oct. 11, 2023, as Appl. No. 18/378,812.
Application 18/378,812 is a continuation of application No. 17/877,691, filed on Jul. 29, 2022, granted, now 11,841,891.
Claims priority of provisional application 63/336,780, filed on Apr. 29, 2022.
Prior Publication US 2024/0037130 A1, Feb. 1, 2024
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 16/35 (2025.01); G06F 16/3332 (2025.01); G06F 16/955 (2019.01)
CPC G06F 16/35 (2019.01) [G06F 16/3334 (2019.01); G06F 16/955 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A method, comprising:
receiving plural Uniform Resource Locators (URLs), each URL of the plural URLs corresponding to a respective webpage of a website;
accessing, from a database, a set of terms, the set of terms having been predetermined as prioritized;
extracting distinct terms corresponding to a path level, a query key and a cvar key for the plural URLs;
computing a similarity score of the distinct terms with the set of terms;
identifying, based on the computing, URLs of the plural URLs having at least one term appearing within the set of terms;
applying weights to the identified URLs, to prioritize the identified URLs relative to other URLs of the plural URLs;
performing, based on applying the weights, hierarchical clustering with respect to the plural URLs, to generate a dendrogram in which the plural URLs are arranged in hierarchical clusters;
storing a representation of the dendrogram;
automatically determining, based on the stored representation of the dendrogram, a predicted page group for each of the plural URLs; and
causing, based on determining the predicted page group for each of the plural URLs, display of metrics corresponding to the website.