US 11,941,534 B2
Genome sequence alignment system and method
Gurpreet Singh Kalsi, Bangalore (IN); Anant V. Nori, Bangalore (IN); Christopher Justin Hughes, Santa Clara, CA (US); Sreenivas Subramoney, Bangalore (IN); and Damla Senol, Pittsburg, PA (US)
Assigned to Intel Corporation, Santa Clara, CA (US)
Filed by Intel Corporation, Santa Clara, CA (US)
Filed on Dec. 28, 2019, as Appl. No. 16/729,379.
Prior Publication US 2021/0201163 A1, Jul. 1, 2021
Int. Cl. G06N 3/123 (2023.01); G06F 9/30 (2018.01); G06F 15/78 (2006.01); G06F 15/80 (2006.01); G06F 40/45 (2020.01); G16B 30/00 (2019.01); G16B 30/10 (2019.01); G06F 17/00 (2019.01)
CPC G06N 3/123 (2013.01) [G06F 9/30036 (2013.01); G06F 15/78 (2013.01); G06F 15/8053 (2013.01); G06F 15/8061 (2013.01); G06F 40/45 (2020.01); G16B 30/00 (2019.02); G16B 30/10 (2019.02); G06F 17/00 (2013.01)] 20 Claims
 
1. A system, comprising:
a bit vector-based distance counter circuitry configured to generate one or more bit vectors encoded with information about potential matches and edits between a read and a reference genome, wherein the read comprises an encoding of a fragment of deoxyribonucleic acid (DNA) encoded via bases G, A, T, C; and
a bit vector-based traceback circuitry configured to divide the reference genome into one or more windows and to use the one or more bit vectors to generate a traceback output for each of the one or more windows, wherein the traceback output comprises a match, a substitution, an insert, a delete, or a combination thereof, between the read and the one or more windows.