| CPC G06N 7/01 (2023.01) [G06F 16/9024 (2019.01); G06F 18/2163 (2023.01); G06F 18/217 (2023.01); G06N 20/20 (2019.01); G06Q 10/0635 (2013.01); G06Q 40/08 (2013.01); G06V 10/751 (2022.01)] | 10 Claims |

|
1. A computer-implemented method, comprising:
receiving, by at least one processor, telematics data and insurance claims data for a population of drivers;
generating, by the at least one processor, a training dataset based on the telematics data, the training dataset including:
values for a proxy variable derived from the telematics data, and
values for one or more features derived from the telematics data for predicting the proxy variable;
generating, by the at least one processor, a testing dataset based on the telematics data and the claims data, the testing dataset including:
values for a target variable derived from the claims data, and
values for the one or more features derived from the telematics data;
generating, by the at least one processor, a statistical model using the training dataset, the statistical model configured to predict values of the proxy variable from values of the one or more features; and
validating, by the at least one processor, the statistical model using the testing dataset.
|