US 11,941,072 B1
Generating a proactive alert for outdated scraping script
Itay Margolin, Tel Aviv (IL); Aleksandr Kim, Tel Aviv (IL); and Yair Horesh, Tel Aviv (IL)
Assigned to INTUIT INC., Mountain View, CA (US)
Filed by INTUIT INC., Mountain View, CA (US)
Filed on Jun. 29, 2023, as Appl. No. 18/344,799.
Int. Cl. G06F 16/951 (2019.01)
CPC G06F 16/951 (2019.01) 20 Claims
OG exemplary drawing
 
1. A method performed by a processor, said method comprising:
generating a sample comprising a predetermined number of webpages targeted by a scraping script;
appending the scraping script to each of the webpages in the sample to generate corresponding predetermined number of documents comprising the webpages appended with the scraping script;
identifying common text fragments across the documents;
generating a structured list of text fragments based on the identified common text fragments; and
outputting an alert that the scraping script needs an update responsive to determining that the structured list of text fragments does not match an existing list of text fragments.