| CPC G06F 21/62 (2013.01) [G06F 16/9577 (2019.01); G06F 16/958 (2019.01)] | 18 Claims |

|
1. A page processing method, comprising:
determining a plurality of layout object nodes of a page, according to an obtained Hypertext Markup Language (HTML) file;
filtering the plurality of layout object nodes according to a preset recall rule to obtain a layout object node satisfying the recall rule, after laying out the plurality of layout object nodes of the page;
predicting whether the layout object node satisfying the recall rule is a designated target node; and
shielding the designated target node, and generating a shielded page based on remaining layout target nodes after the shielding,
wherein predicting whether the layout object node satisfying the recall rule is the designated target node comprises:
calculating a node characteristic of the layout object node satisfying the recall rule, according to attribute information of the layout object node satisfying the recall rule, wherein the node characteristic is a feature having specified number of dimensions extracted and calculated from the attribute information of the layout object node, the specified number of dimensions is greater than or equal to 10;
processing the node characteristic by using a preset node prediction model, to obtain a probability value of the layout object node satisfying the recall rule being the designated target node; and
determining whether the layout object node satisfying the recall rule is the designated target node according to the probability value.
|