US 12,106,826 B2
	Methods and systems for detecting sequence variants
Deniz Kural, Somerville, MA (US)
Assigned to Seven Bridges Genomics Inc., Charlestown, MA (US)
Filed by Seven Bridges Genomics Inc., Charlestown, MA (US)
Filed on Oct. 25, 2023, as Appl. No. 18/494,317.
Application 18/494,317 is a continuation of application No. 17/933,260, filed on Sep. 19, 2022, granted, now 11,837,328.
Application 17/933,260 is a continuation of application No. 16/443,402, filed on Jun. 17, 2019, granted, now 11,488,688, issued on Nov. 1, 2022.
Application 16/443,402 is a continuation of application No. 15/906,404, filed on Feb. 27, 2018, granted, now 10,325,675, issued on Jun. 18, 2019.
Application 15/906,404 is a continuation of application No. 15/196,345, filed on Jun. 29, 2016, granted, now 9,904,763, issued on Feb. 27, 2018.
Application 15/196,345 is a continuation of application No. 14/811,057, filed on Jul. 28, 2015, granted, now 9,390,226, issued on Jul. 12, 2016.
Application 14/811,057 is a continuation of application No. 14/041,850, filed on Sep. 30, 2013, granted, now 9,116,866, issued on Aug. 25, 2015.
Claims priority of provisional application 61/884,380, filed on Sep. 30, 2013.
Claims priority of provisional application 61/868,249, filed on Aug. 21, 2013.
Prior Publication US 2024/0062850 A1, Feb. 22, 2024
Int. Cl. G16B 30/10 (2019.01); G16B 30/00 (2019.01); G16B 30/20 (2019.01); G16B 50/00 (2019.01)

CPC G16B 30/10 (2019.02) [G16B 30/00 (2019.02); G16B 30/20 (2019.02); G16B 50/00 (2019.02)]

20 Claims

1. A method for aligning one or more sequence reads to a genomic reference graph, the one or more sequence reads having been previously obtained from a biological sample from a subject, the method comprising:

using at least one processor to perform:

accessing at least one data structure representing the genomic reference graph, the genomic reference graph representing at least 1,000,000 nucleic acids and comprising nodes and edges connecting the nodes, the nodes including a first node and one or more parent nodes of the first node, the first node representing a first nucleotide sequence stored as a first string of symbols, wherein the at least one data structure stores data specifying the nodes and edges;

aligning the one or more sequence reads to the genomic reference graph using the at least one data structure and a dynamic programming algorithm, the aligning comprising, for each particular sequence read of the one or more sequence reads:

determining scores for entries in a first matrix associated with the first node, the first matrix representing a comparison between the particular sequence read and the first string of symbols, the determining comprising:

determining whether a symbol of the particular sequence read matches a first symbol of the first string of symbols;

accessing a score from one or more matrices associated with the one or more parent nodes of the first node; and

determining a score for an entry in the first matrix based on: (i) a result of determining whether the symbol of the particular sequence read matches the first symbol of the first string of symbols and (ii) the score accessed from the one or more matrices associated with the one or more parent nodes; and

aligning the particular sequence read to the genomic reference graph based on the determined scores; and

generating output indicative of results of aligning the one or more sequence reads to the genomic reference graph.