Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

A probabilistic framework for aligning paired-end RNA-seq data

  • Yin Hu
  • , Kai Wang
  • , Xiaping He
  • , Derek Y. Chiang
  • , Jan F. Prins
  • , Jinze Liu

Producción científica: Articlerevisión exhaustiva

20 Citas (Scopus)

Resumen

Motivation: The RNA-seq paired-end read (PER) protocol samples transcript fragments longer than the sequencing capability of today's technology by sequencing just the two ends of each fragment. Deep sampling of the transcriptome using the PER protocol presents the opportunity to reconstruct the unsequenced portion of each transcript fragment using end reads from overlapping PERs, guided by the expected length of the fragment.Methods: A probabilistic framework is described to predict the alignment to the genome of all PER transcript fragments in a PER dataset. Starting from possible exonic and spliced alignments of all end reads, our method constructs potential splicing paths connecting paired ends. An expectation maximization method assigns likelihood values to all splice junctions and assigns the most probable alignment for each transcript fragment. Results: The method was applied to 2 × 35 bp PER datasets from cancer cell lines MCF-7 and SUM-102. PER fragment alignment increased the coverage 3-fold compared to the alignment of the end reads alone, and increased the accuracy of splice detection. The accuracy of the expectation maximization (EM) algorithm in the presence of alternative paths in the splice graph was validated by qRT-PCR experiments on eight exon skipping alternative splicing events. PER fragment alignment with long-range splicing confirmed 8 out of 10 fusion events identified in the MCF-7 cell line in an earlier study by (Maher et al., 2009).

Idioma originalEnglish
Número de artículobtq336
Páginas (desde-hasta)1950-1957
Número de páginas8
PublicaciónBioinformatics
Volumen26
N.º16
DOI
EstadoPublished - jun 23 2010

Nota bibliográfica

Funding Information:
Around 25% of junctions are primarily supported by PER fragments, while only around 7% of junctions gain substantial support from single end reads. Furthermore, the majority of the junctions (>67%), corresponding to points, have PER support 3-fold higher than single end reads.

Financiación

Around 25% of junctions are primarily supported by PER fragments, while only around 7% of junctions gain substantial support from single end reads. Furthermore, the majority of the junctions (>67%), corresponding to points, have PER support 3-fold higher than single end reads.

FinanciadoresNúmero del financiador
U.S. Department of Energy Chinese Academy of Sciences Guangzhou Municipal Science and Technology Project Oak Ridge National Laboratory Extreme Science and Engineering Discovery Environment National Science Foundation National Energy Research Scientific Computing Center National Natural Science Foundation of China0850237
U.S. Department of Energy Chinese Academy of Sciences Guangzhou Municipal Science and Technology Project Oak Ridge National Laboratory Extreme Science and Engineering Discovery Environment National Science Foundation National Energy Research Scientific Computing Center National Natural Science Foundation of China
National Institutes of Health (NIH)P20RR016481
National Institutes of Health (NIH)
National Childhood Cancer Registry – National Cancer InstituteU24CA143848
National Childhood Cancer Registry – National Cancer Institute
Alfred P Sloan Foundation

    ODS de las Naciones Unidas

    Este resultado contribuye a los siguientes Objetivos de Desarrollo Sostenible

    1. Good health and well being
      Good health and well being

    ASJC Scopus subject areas

    • Statistics and Probability
    • Biochemistry
    • Molecular Biology
    • Computer Science Applications
    • Computational Theory and Mathematics
    • Computational Mathematics

    Huella

    Profundice en los temas de investigación de 'A probabilistic framework for aligning paired-end RNA-seq data'. En conjunto forman una huella única.

    Citar esto