Skip to main navigation Skip to search Skip to main content

New methods to identify high peak density artifacts in Fourier transform mass spectra and to mitigate their effects on high-throughput metabolomic data analysis

Research output: Contribution to journalArticlepeer-review

12 Scopus citations

Abstract

Introduction: Direct injection Fourier-transform mass spectrometry (FT-MS) allows for the high-throughput and high-resolution detection of thousands of metabolite-associated isotopologues. However, spectral artifacts can generate large numbers of spectral features (peaks) that do not correspond to known compounds. Misassignment of these artifactual features creates interpretive errors and limits our ability to discern the role of representative features within living systems. Objectives: Our goal is to develop rigorous methods that identify and handle spectral artifacts within the context of high-throughput FT-MS-based metabolomics studies. Results: We observed three types of artifacts unique to FT-MS that we named high peak density (HPD) sites: fuzzy sites, ringing and partial ringing. While ringing artifacts are well-known, fuzzy sites and partial ringing have not been previously well-characterized in the literature. We developed new computational methods based on comparisons of peak density within a spectrum to identify regions of spectra with fuzzy sites. We used these methods to identify and eliminate fuzzy site artifacts in an example dataset of paired cancer and non-cancer lung tissue samples and evaluated the impact of these artifacts on classification accuracy and robustness. Conclusion: Our methods robustly identified consistent fuzzy site artifacts in our FT-MS metabolomics spectral data. Without artifact identification and removal, 91.4% classification accuracy was achieved on an example lung cancer dataset; however, these classifiers rely heavily on artifactual features present in fuzzy sites. Proper removal of fuzzy site artifacts produces a more robust classifier based on non-artifactual features, with slightly improved accuracy of 92.4% in our example analysis.

Original languageEnglish
Article number125
JournalMetabolomics
Volume14
Issue number10
DOIs
StatePublished - Oct 1 2018

Bibliographical note

Publisher Copyright:
© 2018, The Author(s).

Funding

This work was supported in part by Grant NSF 1252893 (Moseley), NIH 1R03CA211835-01 (C.Wang, Flight), NIH 1R01ES022191-01 (Fan, Higashi, Moseley, Nantz) 1P01CA163223-01A1 (Lane, Fane), NIH 1U24DK097215-01A1 (Higashi, Fan, Lane, Moseley), NIH P30CA177558 (Evers), NIH UL1TR001998-01 (Kern), NIH 5P20GM121327-02 Pilot Project (Q. Wang), and American Heart Association AHA16GRNT31310020 (Q. Wang). In particular, we thank Woo-Young Kang who first detected fuzzy sites in our FT-MS spectra. We thank Wenzhu Zhang and Brian T. Chait in the National Resource for the Mass Spectrometric Analysis of Biological Macromolecule at the Rockefeller University, and Shruti Nayak, Avantika Dhabaria, and Beatrix Ueberheide at the Proteomics Resource Center at the New York University Langone Medical Center for providing us with spectra for our bioinformatics analyses. We thank Thomas Wilson and the High Resolution Metabolomics Laboratory (HRML) at the Institute of Biological, Environmental and Rural Sciences for Q-Exactive + data. We thank Timothy Fahrenholz for collection of direct infusion spectra on the Fusion. We thank Qiushi Sun for collection of ICMS spectra on the Fusion. Finally, we thank Mike Senko from Thermo Fisher Scientific for the extremely helpful discussions on the known and possible origins of the artifacts we have observed in Orbitrap FT-MS instruments. Acknowledgements This work was supported in part by Grant NSF 1252893 (Moseley), NIH 1R03CA211835-01 (C.Wang, Flight), NIH 1R01ES022191-01 (Fan, Higashi, Moseley, Nantz) 1P01CA163223-01A1 (Lane, Fane), NIH 1U24DK097215-01A1 (Higashi, Fan, Lane, Moseley), NIH P30CA177558 (Evers), NIH UL1TR001998-01 (Kern), NIH 5P20GM121327-02 Pilot Project (Q. Wang), and American Heart Association AHA16GRNT31310020 (Q. Wang). In particular, we thank Woo-Young Kang who first detected fuzzy sites in our FT-MS spectra. We thank Wenzhu Zhang and Brian T. Chait in the National Resource for the Mass Spectrometric Analysis of Biological Macromolecule at the Rockefeller University, and Shruti Nayak, Avantika Dhabaria, and Beatrix Ueberheide at the Proteomics Resource Center at the New York University Langone Medical Center for providing us with spectra for our bioinformatics analyses. We thank Thomas Wilson and the High Resolution Metabolomics Laboratory (HRML) at the Institute of Biological, Environmental and Rural Sciences for Q-Exactive + data. We thank Timothy Fahrenholz for collection of direct infusion spectra on the Fusion. We thank Qiushi Sun for collection of ICMS spectra on the Fusion. Finally, we thank Mike Senko from Thermo Fisher Scientific for the extremely helpful discussions on the known and possible origins of the artifacts we have observed in Orbitrap FT-MS instruments.

FundersFunder number
National Science Foundation Arctic Social Science Program1252893
National Institutes of Health (NIH)P30CA177558, UL1TR001998-01, 5P20GM121327-02, 1U24DK097215-01A1, 1R03CA211835-01, 1P01CA163223-01A1
National Institutes of Health/National Institute of Environmental Health SciencesR01ES022191
American the American Heart AssociationAHA16GRNT31310020

    UN SDGs

    This output contributes to the following UN Sustainable Development Goals (SDGs)

    1. SDG 3 - Good Health and Well-being
      SDG 3 Good Health and Well-being

    Keywords

    • Artifact
    • Data analysis
    • Fourier transform
    • Mass spectrometry
    • Metabolomics

    ASJC Scopus subject areas

    • Endocrinology, Diabetes and Metabolism
    • Biochemistry
    • Clinical Biochemistry

    Fingerprint

    Dive into the research topics of 'New methods to identify high peak density artifacts in Fourier transform mass spectra and to mitigate their effects on high-throughput metabolomic data analysis'. Together they form a unique fingerprint.

    Cite this