| CPC G06F 16/951 (2019.01) [G06F 16/955 (2019.01); H04L 63/0236 (2013.01)] | 19 Claims |

|
1. A method for training isolation forests, comprising:
acquiring a plurality of categories by classifying uniform resource identifiers;
acquiring sample behavior data by monitoring a behavior of a client from each of Internet Protocol IP addresses in a plurality of IP addresses accessing the uniform resource identifiers under the plurality of categories;
encoding the sample behavior data as a sample access vector; and
training, based on the sample access vector, an isolation forest for recognizing a web crawler from the client;
wherein encoding the sample behavior data as the sample access vector comprises:
counting a quantity of uniform resource identifiers under each of the categories accessed by the client from each of the IP addresses in the sample behavior data; and
acquiring the sample access vector of each of the IP addresses by respectively setting, with each of the categories as a dimension of the vector, a plurality of quantities corresponding to the plurality of categories as values of a plurality of dimensions corresponding to the plurality of categories in the vector.
|