TY - JOUR
T1 - A linear-time algorithm for finding a maximum-length ORF in a splice graph
AU - Jaromczyk, Jerzy W.
AU - Moore, Neil
AU - Schardl, Christopher L.
PY - 2012/9
Y1 - 2012/9
N2 - We present a linear-time, deterministic algorithm for finding a longest Open Reading Frame (ORF) in an alternatively spliced gene represented by a splice graph. Finding protein-encoding regions is a fundamental problem in genomic and transcriptomic analysis, and in some circumstances long ORFs can provide good predictions of such regions. Splice graphs are a common way of compactly representing what may be exponentially many alternative splicings of a sequence. The efficiency of our algorithm is achieved by pruning the search space so as to bound the number of reading frames considered at any vertex of the splice graph. The algorithm guarantees that the unpruned reading frames contain at least one longest ORF of the gene. We are therefore able to find a longest ORF among all splice variants in time linear in the size of the splice graph, even though the number of potential transcripts may be much larger.
AB - We present a linear-time, deterministic algorithm for finding a longest Open Reading Frame (ORF) in an alternatively spliced gene represented by a splice graph. Finding protein-encoding regions is a fundamental problem in genomic and transcriptomic analysis, and in some circumstances long ORFs can provide good predictions of such regions. Splice graphs are a common way of compactly representing what may be exponentially many alternative splicings of a sequence. The efficiency of our algorithm is achieved by pruning the search space so as to bound the number of reading frames considered at any vertex of the splice graph. The algorithm guarantees that the unpruned reading frames contain at least one longest ORF of the gene. We are therefore able to find a longest ORF among all splice variants in time linear in the size of the splice graph, even though the number of potential transcripts may be much larger.
KW - Alternative splicing
KW - Molecular sequence analysis
KW - ORF
KW - Open reading frames
KW - Splice graphs
UR - http://www.scopus.com/inward/record.url?scp=84866665391&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84866665391&partnerID=8YFLogxK
U2 - 10.1504/IJCBDD.2012.049212
DO - 10.1504/IJCBDD.2012.049212
M3 - Article
C2 - 23013654
AN - SCOPUS:84866665391
SN - 1756-0756
VL - 5
SP - 284
EP - 297
JO - International Journal of Computational Biology and Drug Design
JF - International Journal of Computational Biology and Drug Design
IS - 3-4
ER -