Abstract
In recent years, the FAIR guiding principles and the broader concept of open science has grown in importance in academic research, especially as funding entities have aggressively promoted public sharing of research products. Key to public research sharing is deposition of datasets into online data repositories, but it can be a chore to transform messy unstructured data into the forms required by these repositories. To help generate Metabolomics Workbench depositions, we have developed the MESSES (Metadata from Experimental SpreadSheets Extraction System) software package, implemented in the Python 3 programming language and supported on Linux, Windows, and Mac operating systems. MESSES helps transform tabular data from multiple sources into a Metabolomics Workbench specific deposition format. The package provides three commands, extract, validate, and convert, that implement a natural data transformation workflow. Moreover, MESSES facilitates richer metadata capture than is typically attempted by manual efforts. The source code and extensive documentation is hosted on GitHub and is also available on the Python Package Index for easy installation.
| Original language | English |
|---|---|
| Article number | 842 |
| Journal | Metabolites |
| Volume | 13 |
| Issue number | 7 |
| DOIs | |
| State | Published - Jul 2023 |
Bibliographical note
Publisher Copyright:© 2023 by the authors.
Funding
The research was funded by the National Institutes of Health, grant number P42 ES007380 (University of Kentucky Superfund Research Program Grant; PI Pennell), and by the National Science Foundation, grant number 2020026 (PI Moseley). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of Environmental Health Sciences nor the National Science Foundation.
| Funders | Funder number |
|---|---|
| National Science Foundation Arctic Social Science Program | 2020026 |
| National Science Foundation Arctic Social Science Program | |
| National Institutes of Health (NIH) | P42 ES007380 |
| National Institutes of Health (NIH) | |
| University of Kentucky |
Keywords
- Metabolomics Workbench
- Python programming language
- data sharing
- data transformation
- dataset deposition
- metadata capture
ASJC Scopus subject areas
- Endocrinology, Diabetes and Metabolism
- Biochemistry
- Molecular Biology