CPC G16B 5/20 (2019.02) [G06N 7/01 (2023.01); G16B 30/10 (2019.02); G16B 50/30 (2019.02); G16B 30/20 (2019.02)] | 21 Claims |
1. A method of processing a plurality of sample reads for genome sequencing using at least one systolic array including a plurality of groups of cells, the method comprising:
for each sample read of the plurality of sample reads:
comparing substring sequences from the sample read to reference sequences representing different portions of a reference genome by storing the substring sequences in the plurality of groups of cells of the at least one systolic array, wherein each group of the plurality of groups of cells further stores a reference sequence for comparison to the sub string sequences;
identifying one or more stored reference sequences that match one or more of the compared substrings; and
determining a probabilistic location of the sample read within the reference genome based on the one or more identified reference sequences; and
partitioning the reference genome for referenced-aligned genome sequencing based at least in part on the determined probabilistic locations of the plurality of sample reads.
|