CPC G06Q 30/0277 (2013.01) [G06F 9/54 (2013.01); G06F 18/214 (2023.01); G06F 18/24 (2023.01); G06F 40/143 (2020.01); G06F 40/40 (2020.01); G06N 20/00 (2019.01); G06V 10/764 (2022.01); G06V 10/82 (2022.01)] | 20 Claims |
1. A computer system comprising:
memory; and
a processor in communication with the memory and configured with processor-executable instructions to perform operations comprising:
receiving a request to perform contextual analysis of a page, wherein the page is accessible over a network via a uniform resource identifier (URI);
send a network request for the page via the URI;
receive an initial file in response to the network request;
initiating execution of a headless content extractor, wherein the headless content extractor is configured to extract content included in or referenced in the initial file without causing display of the content, wherein at least a portion of the content extracted is not included in the initial file accessible via the URI but is requested from an external system over the network based at least in part on execution by the headless content extractor of code included in the initial file;
performing contextual analysis of the content extracted by the headless content extractor, including contextual analysis of the portion of the content not included in the initial file; and
storing, in an electronic data store, keywords associated with the page determined from the results of the contextual analysis, wherein storing the keywords includes associating the keywords with the URI in the electronic data store.
|