| CPC G06F 16/906 (2019.01) [G06F 16/215 (2019.01); G06F 16/2246 (2019.01); G06F 16/258 (2019.01); G06F 16/287 (2019.01); G06F 18/22 (2023.01); G06N 3/045 (2023.01); G06N 20/20 (2019.01)] | 20 Claims |

|
1. A computer-implemented method comprising:
extracting, from a first tree node of a first data tree comprising a first plurality of interconnected nodes representing relationships among a first set of tree persons, a first set of features using a feature extractor;
extracting, from a second tree node of a second data tree comprising a second plurality of interconnected nodes representing relationships among a second set of tree persons, a second set of features using the feature extractor;
identifying, within a cluster database storing clusters of tree nodes and from the first set of features and the second set of features, a cluster comprising a set of tree nodes including tree nodes from the first data tree and the second data tree, wherein the set of tree nodes corresponds to a single tree person by:
generating an individual-level similarity score between the first tree node and the second tree node;
generating additional individual-level similarity scores between additional nodes of the first plurality of interconnected nodes within the first data tree and additional nodes of the second plurality of interconnected nodes within the first data tree and additional nodes of the second plurality of interconnected nodes within the second data tree; and
determining the cluster based on the individual-level similarity score and the additional individual-level similarity scores; and
based on identifying the cluster comprising the set of tree nodes corresponding to the single tree person, modifying the cluster database to include the cluster representing the single tree person.
|