US 12,224,042 B2
Devices and methods for genome sequencing
Wen Ma, Sunnyvale, CA (US); Tung Thanh Hoang, San Jose, CA (US); Daniel Bedau, San Jose, CA (US); and Justin Kinney, San Jose, CA (US)
Assigned to Sandisk Technologies, Inc., Milpitas, CA (US)
Filed by Western Digital Technologies, Inc., San Jose, CA (US)
Filed on Jun. 22, 2020, as Appl. No. 16/908,581.
Prior Publication US 2021/0398618 A1, Dec. 23, 2021
Int. Cl. G16B 40/20 (2019.01); C12Q 1/6869 (2018.01); G16B 30/10 (2019.01)
CPC G16B 40/20 (2019.02) [C12Q 1/6869 (2013.01); G16B 30/10 (2019.02)] 26 Claims
OG exemplary drawing
 
1. A device, comprising:
a plurality of arrays of Non-Volatile Memory (NVM) cells; and
circuitry configured to:
store reference sequences in respective groups of NVM cells of the plurality of arrays, each reference sequence representing a portion of a genome;
load one or more exact matching phase substring sequences into groups of NVM cells of the plurality of arrays for comparison to reference sequences stored in the groups of NVM cells, the one or more exact matching phase substring sequences representing portions of at least one sample read;
identify, in an exact matching phase, one or more groups of NVM cells in the plurality of arrays where the stored reference sequence matches the loaded exact matching phase substring sequence using the plurality of arrays as Content Addressable Memories (CAMs), wherein none of the compared reference sequences and the one or more exact matching phase substring sequences include wildcard values when using the plurality of arrays as CAMs in the exact matching phase;
load one or more approximate matching phase substring sequences into groups of NVM cells of the plurality of arrays for comparison to reference sequences stored in the groups of NVM cells, the one or more approximate matching phase substring sequences representing portions of the at least one sample read;
identify, in an approximate matching phase, one or more groups of NVM cells in the plurality of arrays where the stored reference sequence approximately matches the loaded approximate matching phase substring sequence using the plurality of arrays as Ternary CAMs (TCAMs), wherein at least one of the compared reference sequence and the approximate matching phase substring sequence for each group of NVM cells includes at least one wildcard value when using the plurality of arrays as TCAMs in the approximate matching phase; and
based on an indication of one or more identified groups of NVM cells from a first array of the plurality of arrays where one or more stored reference sequences match or approximately match a first substring sequence, perform at least one of:
store one or more additional reference sequences in a second array of the plurality of arrays; and
load at least one additional substring sequence into the second array.