| CPC G16B 20/00 (2019.02) [G16B 30/00 (2019.02); G16B 50/00 (2019.02)] | 11 Claims |
|
1. A computer-implemented method for determining the genetic relatedness of multiple individuals of a particular species with fewer computational steps by a processor, comprising executing on a processor the steps of:
obtaining from a computer memory digital files of a first deoxyribonucleic acid (DNA) sequence information of a first individual and a second DNA sequence information of a second individual, each of said first and second DNA sequence information including base pair data for a set of single nucleotide polymorphism (SNP) loci;
said processor organizes the base pair data of each of said first and second DNA sequence information, wherein the base pair data present at each of said set of SNP loci are transformed into a standardized SNP data sequence format according to a digital SNP loci template having a pre-determined set of SNP loci, wherein a first digital file comprising a first standardized SNP sequence from said first DNA sequence and a second digital file comprising second standardized SNP sequence from said second DNA sequence are created, wherein each bit in said first standardized SNP sequence and said second standardized SNP sequence represents a base pair at one of said pre-determined set of SNP loci;
said processor divides the first standardized SNP sequence into a first set of digital data blocks and the second standardized SNP sequence into a second set of digital data blocks, each of said data blocks having a pre-determined bit length that is equal to a bit word length of said processor; and
said processor compares said first set of digital blocks with said second set of digital blocks,
wherein said first set of digital data blocks is aligned with said second set of digital data blocks according to said digital SNP loci template, a first predetermined sequence of digital data blocks of said first set of digital data blocks represents a first comparison segment and a first predetermined sequence of said second set of data blocks represents a second comparison segment, wherein the first comparison segment and said second comparison segment have a pre-determined segment length, and said processor performs the comparison of the first comparison segment and the second comparison segment in a stepwise block-by-block process beginning at a last digital data block of each of the first and second comparison segments, working backwards toward a first digital data block of each of the first and second comparison segments until
(1) a mismatch between the first and second comparison segments at an SNP locus is found or
(2) the entire lengths of the first and second comparison segments are compared with no mismatches at the SNP loci found,
wherein aligned digital data blocks are compared in a single computational step by the processor, and in the event a mismatch is identified in the comparison of the first comparison segment and said second comparison segment, a third comparison segment of said pre-determined segment length is identified starting from the last digital data block free from mismatches between the SNP loci of the first and second comparison segments and extending for said pre-determined segment length along said first set of digital data blocks, and a fourth comparison segment of said pre-determined segment length is identified starting from the last digital data block free from mismatches and extending for said pre-determined segment length along said second set of digital data blocks, and
said processor performing a comparison of the third comparison segment and the fourth comparison segment in a stepwise block-by-block process beginning at a last digital data block of each of the third and fourth comparison segments working backwards toward a first digital data block of each of the third and fourth comparison segments until
(1) a mismatch at an SNP locus between the third and fourth comparison segments is found or
(2) the entire lengths of the third and fourth comparison segments are compared with no mismatches at the SNP loci.
|