US 11,734,609 B1
	Customized predictive analytical model training
Jordan M. Breckenridge, Menlo Park, CA (US); Travis H. K. Green, New York, NY (US); Robert Kaplow, New York, NY (US); Wei-Hao Lin, New York, NY (US); and Gideon S. Mann, New York, NY (US)
Assigned to Google LLC, Mountain View, CA (US)
Filed by Google LLC, Mountain View, CA (US)
Filed on Jun. 17, 2021, as Appl. No. 17/350,991.
Application 17/350,991 is a continuation of application No. 15/155,343, filed on May 16, 2016, granted, now 11,042,809.
Application 15/155,343 is a continuation of application No. 14/295,563, filed on Jun. 4, 2014, granted, now 9,342,798, issued on May 17, 2016.
Application 14/295,563 is a continuation of application No. 13/170,067, filed on Jun. 27, 2011, granted, now 8,762,299, issued on Jun. 24, 2014.
This patent is subject to a terminal disclaimer.
Int. Cl. G06N 20/00 (2019.01); G06N 5/02 (2023.01); H04L 67/02 (2022.01); G06F 18/21 (2023.01); G06F 18/20 (2023.01); G06N 7/01 (2023.01)

CPC G06N 20/00 (2019.01) [G06F 18/217 (2023.01); G06F 18/285 (2023.01); G06N 5/02 (2013.01); G06N 7/01 (2023.01); H04L 67/02 (2013.01)]

15 Claims

1. A computer-implemented method comprising:

receiving a plurality of first training data records of a first data type, wherein each first training data record includes an input data portion and an output data portion;

determining a first training data type that corresponds to the first training data records, comprising:

parsing each first training data record;

comparing the output data portions of the first training data records to a plurality of data formats;

based on the comparison, determining a match to a particular data format of the plurality of data formats and determining the first training data type based on the particular data format;

based on the determined first training data type, estimating, for each predictive model of a plurality of predictive models, an effectiveness the predictive model;

based on the estimation, selecting a proper subset of the predictive models;

training each predictive model of the proper subset of predictive models based on, at least, the first training data records;

scoring each predictive model in the proper subset of predictive models, each score for each model representing an estimation of the effectiveness of the respective trained predictive model for data records of the first data type; and

selecting, as a first selected predictive model for use with the first data type, a trained predictive model having a highest score from the proper subset of predictive models.