| CPC G06F 16/906 (2019.01) | 20 Claims |

|
1. A method comprising:
identifying, by a central database system for each of a plurality of historical users, an action being performed by the historical user within a domain and a set of consecutive web pages viewed by the historical user while performing the action, each of the consecutive web pages corresponding to a web address and each web address corresponding to a web page of the consecutive web pages within a threshold similarity of the web address corresponding to each other web page of the consecutive web pages;
generating, by the central database system, a training data set comprising, for each of the plurality of historical users, the identified action and the set of web pages viewed while the action is being performed;
training, by the central database system, a cluster model using the training data set, the cluster model configured to predict a desired action to be performed by an acting user based on web pages viewed by the acting user;
applying, by the central database system, the cluster model to web pages viewed by a target user and a predicted next web page to be viewed by the target user to identify an action being performed by the target user; and
retraining the cluster model in response to determining that the target user is performing a new action different from actions performed by the plurality of historical users.
|