Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

New methods to identify high peak density artifacts in Fourier transform mass spectra and to mitigate their effects on high-throughput metabolomic data analysis

Producción científica: Articlerevisión exhaustiva

12 Citas (Scopus)

Resumen

Introduction: Direct injection Fourier-transform mass spectrometry (FT-MS) allows for the high-throughput and high-resolution detection of thousands of metabolite-associated isotopologues. However, spectral artifacts can generate large numbers of spectral features (peaks) that do not correspond to known compounds. Misassignment of these artifactual features creates interpretive errors and limits our ability to discern the role of representative features within living systems. Objectives: Our goal is to develop rigorous methods that identify and handle spectral artifacts within the context of high-throughput FT-MS-based metabolomics studies. Results: We observed three types of artifacts unique to FT-MS that we named high peak density (HPD) sites: fuzzy sites, ringing and partial ringing. While ringing artifacts are well-known, fuzzy sites and partial ringing have not been previously well-characterized in the literature. We developed new computational methods based on comparisons of peak density within a spectrum to identify regions of spectra with fuzzy sites. We used these methods to identify and eliminate fuzzy site artifacts in an example dataset of paired cancer and non-cancer lung tissue samples and evaluated the impact of these artifacts on classification accuracy and robustness. Conclusion: Our methods robustly identified consistent fuzzy site artifacts in our FT-MS metabolomics spectral data. Without artifact identification and removal, 91.4% classification accuracy was achieved on an example lung cancer dataset; however, these classifiers rely heavily on artifactual features present in fuzzy sites. Proper removal of fuzzy site artifacts produces a more robust classifier based on non-artifactual features, with slightly improved accuracy of 92.4% in our example analysis.

Idioma originalEnglish
Número de artículo125
PublicaciónMetabolomics
Volumen14
N.º10
DOI
EstadoPublished - oct 1 2018

Nota bibliográfica

Publisher Copyright:
© 2018, The Author(s).

Financiación

This work was supported in part by Grant NSF 1252893 (Moseley), NIH 1R03CA211835-01 (C.Wang, Flight), NIH 1R01ES022191-01 (Fan, Higashi, Moseley, Nantz) 1P01CA163223-01A1 (Lane, Fane), NIH 1U24DK097215-01A1 (Higashi, Fan, Lane, Moseley), NIH P30CA177558 (Evers), NIH UL1TR001998-01 (Kern), NIH 5P20GM121327-02 Pilot Project (Q. Wang), and American Heart Association AHA16GRNT31310020 (Q. Wang). In particular, we thank Woo-Young Kang who first detected fuzzy sites in our FT-MS spectra. We thank Wenzhu Zhang and Brian T. Chait in the National Resource for the Mass Spectrometric Analysis of Biological Macromolecule at the Rockefeller University, and Shruti Nayak, Avantika Dhabaria, and Beatrix Ueberheide at the Proteomics Resource Center at the New York University Langone Medical Center for providing us with spectra for our bioinformatics analyses. We thank Thomas Wilson and the High Resolution Metabolomics Laboratory (HRML) at the Institute of Biological, Environmental and Rural Sciences for Q-Exactive + data. We thank Timothy Fahrenholz for collection of direct infusion spectra on the Fusion. We thank Qiushi Sun for collection of ICMS spectra on the Fusion. Finally, we thank Mike Senko from Thermo Fisher Scientific for the extremely helpful discussions on the known and possible origins of the artifacts we have observed in Orbitrap FT-MS instruments. Acknowledgements This work was supported in part by Grant NSF 1252893 (Moseley), NIH 1R03CA211835-01 (C.Wang, Flight), NIH 1R01ES022191-01 (Fan, Higashi, Moseley, Nantz) 1P01CA163223-01A1 (Lane, Fane), NIH 1U24DK097215-01A1 (Higashi, Fan, Lane, Moseley), NIH P30CA177558 (Evers), NIH UL1TR001998-01 (Kern), NIH 5P20GM121327-02 Pilot Project (Q. Wang), and American Heart Association AHA16GRNT31310020 (Q. Wang). In particular, we thank Woo-Young Kang who first detected fuzzy sites in our FT-MS spectra. We thank Wenzhu Zhang and Brian T. Chait in the National Resource for the Mass Spectrometric Analysis of Biological Macromolecule at the Rockefeller University, and Shruti Nayak, Avantika Dhabaria, and Beatrix Ueberheide at the Proteomics Resource Center at the New York University Langone Medical Center for providing us with spectra for our bioinformatics analyses. We thank Thomas Wilson and the High Resolution Metabolomics Laboratory (HRML) at the Institute of Biological, Environmental and Rural Sciences for Q-Exactive + data. We thank Timothy Fahrenholz for collection of direct infusion spectra on the Fusion. We thank Qiushi Sun for collection of ICMS spectra on the Fusion. Finally, we thank Mike Senko from Thermo Fisher Scientific for the extremely helpful discussions on the known and possible origins of the artifacts we have observed in Orbitrap FT-MS instruments.

FinanciadoresNúmero del financiador
National Science Foundation Arctic Social Science Program1252893
National Institutes of Health (NIH)P30CA177558, UL1TR001998-01, 5P20GM121327-02, 1U24DK097215-01A1, 1R03CA211835-01, 1P01CA163223-01A1
National Institutes of Health/National Institute of Environmental Health SciencesR01ES022191
American the American Heart AssociationAHA16GRNT31310020

    ODS de las Naciones Unidas

    Este resultado contribuye a los siguientes Objetivos de Desarrollo Sostenible

    1. Good health and well being
      Good health and well being

    ASJC Scopus subject areas

    • Endocrinology, Diabetes and Metabolism
    • Biochemistry
    • Clinical Biochemistry

    Huella

    Profundice en los temas de investigación de 'New methods to identify high peak density artifacts in Fourier transform mass spectra and to mitigate their effects on high-throughput metabolomic data analysis'. En conjunto forman una huella única.

    Citar esto