US 11,837,330 B2
Reference-guided genome sequencing
Justin Kinney, San Jose, CA (US)
Assigned to Western Digital Technologies, Inc., San Jose, CA (US)
Filed by Western Digital Technologies, Inc., San Jose, CA (US)
Filed on Mar. 18, 2020, as Appl. No. 16/822,010.
Prior Publication US 2021/0295946 A1, Sep. 23, 2021
This patent is subject to a terminal disclaimer.
Int. Cl. G16B 30/10 (2019.01); G16B 50/30 (2019.01); G16B 30/20 (2019.01); G06N 7/01 (2023.01); G16B 5/20 (2019.01)
CPC G16B 5/20 (2019.02) [G06N 7/01 (2023.01); G16B 30/10 (2019.02); G16B 50/30 (2019.02); G16B 30/20 (2019.02)] 21 Claims
OG exemplary drawing
 
1. A method of processing a plurality of sample reads for genome sequencing using at least one systolic array including a plurality of groups of cells, the method comprising:
for each sample read of the plurality of sample reads:
comparing substring sequences from the sample read to reference sequences representing different portions of a reference genome by storing the substring sequences in the plurality of groups of cells of the at least one systolic array, wherein each group of the plurality of groups of cells further stores a reference sequence for comparison to the sub string sequences;
identifying one or more stored reference sequences that match one or more of the compared substrings; and
determining a probabilistic location of the sample read within the reference genome based on the one or more identified reference sequences; and
partitioning the reference genome for referenced-aligned genome sequencing based at least in part on the determined probabilistic locations of the plurality of sample reads.