US 12,332,974 B2
Determination of data-source influence on data manifestations
Andre Everson Kim, Upland, CA (US); Alisa Elnaz Sedghifar, San Francisco, CA (US); Ross Eugene Curtis, Cedar Hills, UT (US); and Caitlyn Elizabeth Bruns, Saratoga Springs, UT (US)
Assigned to Ancestry.com DNA, LLC, Lehi, UT (US)
Filed by Ancestry.com DNA, LLC, Lehi, UT (US)
Filed on Jun. 28, 2024, as Appl. No. 18/759,587.
Claims priority of provisional application 63/511,084, filed on Jun. 29, 2023.
Prior Publication US 2025/0005108 A1, Jan. 2, 2025
Int. Cl. G06F 16/00 (2019.01); G06F 18/2415 (2023.01)
CPC G06F 18/2415 (2023.01) 20 Claims
OG exemplary drawing
 
1. A computer-implemented method for predicting data-source influences on a data manifestation of a named entity, the computer-implemented method comprising:
receiving an inheritance dataset of the named entity, the inheritance dataset comprising one or more reads at a plurality of data-bit regions;
determining a first portion and a second portion of the inheritance dataset of the named entity, the first portion inherited from a first data source, and the second portion inherited from a second data source;
identifying a subset of the data-bit regions that are associated with the data manifestation;
determining a first source-specific association score representing a first measurement of association between the first data source and the data manifestation, wherein the first source-specific association score is determined based on the first portion of the inheritance dataset at the identified subset of the data-bit regions;
determining a second source-specific association score representing a second measurement of association between the second data source and the data manifestation, wherein the second source-specific association score is determined based on the second portion of the inheritance dataset at the identified subset of the data-bit regions;
comparing the first source-specific association score to the second source-specific association score; and
identifying at least one of the first data source and the second data source as having a measure of influence on the data manifestation of the named entity.