Abstract
Introduction: Direct injection Fourier-transform mass spectrometry (FT-MS) allows for the high-throughput and high-resolution detection of thousands of metabolite-associated isotopologues. However, spectral artifacts can generate large numbers of spectral features (peaks) that do not correspond to known compounds. Misassignment of these artifactual features creates interpretive errors and limits our ability to discern the role of representative features within living systems. Objectives: Our goal is to develop rigorous methods that identify and handle spectral artifacts within the context of high-throughput FT-MS-based metabolomics studies. Results: We observed three types of artifacts unique to FT-MS that we named high peak density (HPD) sites: fuzzy sites, ringing and partial ringing. While ringing artifacts are well-known, fuzzy sites and partial ringing have not been previously well-characterized in the literature. We developed new computational methods based on comparisons of peak density within a spectrum to identify regions of spectra with fuzzy sites. We used these methods to identify and eliminate fuzzy site artifacts in an example dataset of paired cancer and non-cancer lung tissue samples and evaluated the impact of these artifacts on classification accuracy and robustness. Conclusion: Our methods robustly identified consistent fuzzy site artifacts in our FT-MS metabolomics spectral data. Without artifact identification and removal, 91.4% classification accuracy was achieved on an example lung cancer dataset; however, these classifiers rely heavily on artifactual features present in fuzzy sites. Proper removal of fuzzy site artifacts produces a more robust classifier based on non-artifactual features, with slightly improved accuracy of 92.4% in our example analysis.
| Original language | English |
|---|---|
| Article number | 125 |
| Journal | Metabolomics |
| Volume | 14 |
| Issue number | 10 |
| DOIs | |
| State | Published - Oct 1 2018 |
Bibliographical note
Publisher Copyright:© 2018, The Author(s).
Funding
This work was supported in part by Grant NSF 1252893 (Moseley), NIH 1R03CA211835-01 (C.Wang, Flight), NIH 1R01ES022191-01 (Fan, Higashi, Moseley, Nantz) 1P01CA163223-01A1 (Lane, Fane), NIH 1U24DK097215-01A1 (Higashi, Fan, Lane, Moseley), NIH P30CA177558 (Evers), NIH UL1TR001998-01 (Kern), NIH 5P20GM121327-02 Pilot Project (Q. Wang), and American Heart Association AHA16GRNT31310020 (Q. Wang). In particular, we thank Woo-Young Kang who first detected fuzzy sites in our FT-MS spectra. We thank Wenzhu Zhang and Brian T. Chait in the National Resource for the Mass Spectrometric Analysis of Biological Macromolecule at the Rockefeller University, and Shruti Nayak, Avantika Dhabaria, and Beatrix Ueberheide at the Proteomics Resource Center at the New York University Langone Medical Center for providing us with spectra for our bioinformatics analyses. We thank Thomas Wilson and the High Resolution Metabolomics Laboratory (HRML) at the Institute of Biological, Environmental and Rural Sciences for Q-Exactive + data. We thank Timothy Fahrenholz for collection of direct infusion spectra on the Fusion. We thank Qiushi Sun for collection of ICMS spectra on the Fusion. Finally, we thank Mike Senko from Thermo Fisher Scientific for the extremely helpful discussions on the known and possible origins of the artifacts we have observed in Orbitrap FT-MS instruments. Acknowledgements This work was supported in part by Grant NSF 1252893 (Moseley), NIH 1R03CA211835-01 (C.Wang, Flight), NIH 1R01ES022191-01 (Fan, Higashi, Moseley, Nantz) 1P01CA163223-01A1 (Lane, Fane), NIH 1U24DK097215-01A1 (Higashi, Fan, Lane, Moseley), NIH P30CA177558 (Evers), NIH UL1TR001998-01 (Kern), NIH 5P20GM121327-02 Pilot Project (Q. Wang), and American Heart Association AHA16GRNT31310020 (Q. Wang). In particular, we thank Woo-Young Kang who first detected fuzzy sites in our FT-MS spectra. We thank Wenzhu Zhang and Brian T. Chait in the National Resource for the Mass Spectrometric Analysis of Biological Macromolecule at the Rockefeller University, and Shruti Nayak, Avantika Dhabaria, and Beatrix Ueberheide at the Proteomics Resource Center at the New York University Langone Medical Center for providing us with spectra for our bioinformatics analyses. We thank Thomas Wilson and the High Resolution Metabolomics Laboratory (HRML) at the Institute of Biological, Environmental and Rural Sciences for Q-Exactive + data. We thank Timothy Fahrenholz for collection of direct infusion spectra on the Fusion. We thank Qiushi Sun for collection of ICMS spectra on the Fusion. Finally, we thank Mike Senko from Thermo Fisher Scientific for the extremely helpful discussions on the known and possible origins of the artifacts we have observed in Orbitrap FT-MS instruments.
| Funders | Funder number |
|---|---|
| National Science Foundation Arctic Social Science Program | 1252893 |
| National Institutes of Health (NIH) | P30CA177558, UL1TR001998-01, 5P20GM121327-02, 1U24DK097215-01A1, 1R03CA211835-01, 1P01CA163223-01A1 |
| National Institutes of Health/National Institute of Environmental Health Sciences | R01ES022191 |
| American the American Heart Association | AHA16GRNT31310020 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Artifact
- Data analysis
- Fourier transform
- Mass spectrometry
- Metabolomics
ASJC Scopus subject areas
- Endocrinology, Diabetes and Metabolism
- Biochemistry
- Clinical Biochemistry
Fingerprint
Dive into the research topics of 'New methods to identify high peak density artifacts in Fourier transform mass spectra and to mitigate their effects on high-throughput metabolomic data analysis'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver