CPC G06N 20/20 (2019.01) [G06N 20/00 (2019.01); G16H 50/20 (2018.01); G16H 50/50 (2018.01); G16H 70/60 (2018.01)] | 24 Claims |
1. A method for predictive disease identification, comprising:
applying a plurality of first machine learning models based on training data in order to generate a first set of outputs, wherein the plurality of first machine learning models includes an ensemble of boosting machine learning models and a logistic regression model, wherein the ensemble of boosting machine learning models is sequentially trained using a boosting algorithm in a sequence, wherein misclassifications by a model among the ensemble of boosting machine learning models in the sequence are used to adjust weights of subsequent models among the ensemble of boosting machine learning models in the sequence;
training a second machine learning model based on the first set of outputs, wherein the second machine learning model is a combiner model trained to output a plurality of disease predictor values based on the first set of outputs, wherein each disease predictor value corresponds to a respective disease type of a plurality of disease types, wherein each disease type of the plurality of disease types corresponds to a predetermined group of diseases;
applying the plurality of first machine learning models and the second machine learning model based on features extracted from data including animal characteristics data of an animal, wherein outputs of the plurality of first machine learning models and the second machine learning model include a plurality of disease predictor values, wherein each disease predictor value corresponds to a respective disease type of a plurality of disease types, wherein each disease type of the plurality of disease types corresponds to a predetermined group of diseases;
determining a plurality of simulation parameters based on the plurality of disease predictor values, wherein the plurality of simulation parameters define at least a plurality of time periods for which a plurality of disease contraction simulations are to be run;
running the plurality of disease contraction simulations based on the plurality of simulation parameters in order to obtain simulation results, wherein running the plurality of disease contraction simulations includes providing the plurality of disease predictor values to a simulation engine configured to determine predictions of diseases for animals, wherein the plurality of disease contraction simulations are run for the plurality of time periods defined in the plurality of simulation parameters;
generating disease contraction statistics based on the simulation results; and
determining, based on the disease contraction statistics, at least one disease prediction for the animal.
|