CPC G06Q 30/0278 (2013.01) [G06Q 30/02 (2013.01); G06Q 50/16 (2013.01)] | 20 Claims |
1. A computer-implemented method, in a computing system having a memory and a processor, for generating machine learning models to value homes located in a distinguished geographic area, comprising:
retrieving, by the processor, home sales data for the distinguished geographic area, the home sales data comprising multiple entries, each entry indicating a selling price and a value for one or more home attributes;
creating, by the processor, one or more machine learning classification trees by:
for each distinguished classification tree of the one or more classification trees:
selecting a subset of the multiple entries;
selecting a subset of the one or more home attributes;
for each of the selected home attributes, determining a range of values of the selected attribute among the selected entries;
establishing a root node in the distinguished classification tree representing the range of values of each of the selected attributes; and
for each distinguished node of the tree that has not been identified as a leaf node, determining a greatest information gain resulting from one or more possible splits in the ranges of values represented by the distinguished node;
when the greatest information gain exceeds an information gain identified for the distinguished node, establishing, for each of two subranges corresponding to the split with the greatest information gain, a child node of the distinguished node; and
when the greatest information gain does not exceed the information gain identified for the distinguished node, identifying the distinguished node as a leaf node and calculating a mean selling price for homes represented by the leaf node.
|