US 12,332,902 B2
Filtering individual datasets in a database
Milos Pavlovic, Sandy, UT (US); and Ross Eugene Curtis, Cedar Hills, UT (US)
Assigned to Ancestry.com DNA, LLC, Lehi, UT (US)
Filed by Ancestry.com DNA, LLC, Lehi, UT (US)
Filed on Apr. 20, 2023, as Appl. No. 18/137,232.
Claims priority of provisional application 63/332,811, filed on Apr. 20, 2022.
Prior Publication US 2023/0342364 A1, Oct. 26, 2023
Int. Cl. G06F 16/24 (2019.01); G06F 16/2457 (2019.01); G06N 3/0464 (2023.01); G06N 3/08 (2023.01)
CPC G06F 16/2457 (2019.01) [G06N 3/0464 (2023.01); G06N 3/08 (2013.01)] 18 Claims
OG exemplary drawing
 
1. A computer-implemented method, comprising:
receiving a target individual genetic dataset associated with a target individual;
identifying a plurality of matched individuals who genetically match with the target individual;
identifying a plurality of potential ancestors who are potential common ancestors between the target individual and one of the matched individuals;
inputting a set of features related to the target individual to a machine learning model, wherein training of the machine learning model comprises:
generating a plurality of training samples, wherein generating one of the plurality of training samples comprises:
identifying a training target individual, the training target individual having a genetic dataset,
identifying ancestors of the training target individual from one or more family trees of the training target individual,
determining, based on existing family trees that includes the training target individuals, whether each of the ancestors is a direct-line ancestor of the training target individual, and
assigning a positive label to a particular ancestor responsive to the particular ancestor being a direct-line ancestor of the training target individual; and
training the machine learning model using the plurality of training samples;
filtering the plurality of potential common ancestors using the machine learning model to identify a subset of the potential common ancestors of the target individual.