CPC G06F 16/951 (2019.01) [G06F 16/285 (2019.01); G06F 16/335 (2019.01); G06F 16/353 (2019.01); G06F 16/9566 (2019.01)] | 19 Claims |
1. A system, comprising:
a processor configured to:
in response to receiving a website misclassification report comprising a user-reported indication that access to a first URL, having an associated first domain, is erroneously blocked, wherein the first URL was previously assigned a categorization associated with at least a first subject matter topic based on content analysis performed by an original classification model, use a single page classifier to perform a recrawl-reclassification operation on the first URL using a current classification model that is an updated version of the original classification model, to determine at least one current subject matter topic and assign a current categorization for the first URL; and
determine that there is at least one discrepancy among at least two of: the first categorization, the current categorization, or information included in the misclassification report, and take a remedial action in response to the determination, wherein the remedial action includes at least one of: (1) initiating an escalation event when the first categorization and the result of the recrawl-reclassification operation are in agreement, or (2) assigning the current categorization to the URL when the current categorization is different from the first categorization and initiating a recrawl-reclassification operation on at least one additional domain; and
a memory coupled to the processor and configured to provide the processor with instructions.
|