US 12,326,894 B2
Systems and methods for determining ethnicity subregions
Alisa Elnaz Sedghifar, San Francisco, CA (US); Andre Everson Kim, Upland, CA (US); Ju Zhang, San Jose, CA (US); Ross Eugene Curtis, Cedar Hills, UT (US); Natalie Anne Swinford, Saratoga Springs, UT (US); Jeffrey Adrion, Salt Lake City, UT (US); and Yong Wang, San Mateo, CA (US)
Assigned to Ancestry.com DNA, LLC, Lehi, UT (US)
Filed by Ancestry.com DNA, LLC, Lehi, UT (US)
Filed on Jun. 6, 2024, as Appl. No. 18/736,429.
Claims priority of provisional application 63/506,722, filed on Jun. 7, 2023.
Prior Publication US 2024/0411793 A1, Dec. 12, 2024
Int. Cl. G06F 7/00 (2006.01); G06F 16/35 (2019.01)
CPC G06F 16/35 (2019.01) 20 Claims
OG exemplary drawing
 
1. A computer-implemented method, comprising:
receiving an inheritance dataset of a target named entity;
accessing a plurality of clusters that are associated with a region, each cluster comprising inheritance data for a plurality of reference panel named entities;
determining that the inheritance dataset of the target named entity has at least a threshold amount of data strings that are classified to the region;
comparing, for each cluster, the inheritance dataset of the target named entity to the reference panel named entities in the cluster to identify shared data string segments between the target named entity and the reference panel named entities;
determining, for each cluster, a metric based on the data string segments shared between the target named entity and the reference panel named entities included in the cluster;
comparing, for each cluster, the metric to a threshold specific to the cluster; and
assigning the target named entity to one or more data origins based on the comparison between the metric and the threshold specific to each cluster.