Resumen
Rapidly growing volume of electrophysiological signals has been generated for clinical research in neurological disorders. European Data Format (EDF) is a standard format for storing electrophysiological signals. However, the bottleneck of existing signal analysis tools for handling large-scale datasets is the sequential way of loading large EDF files before performing signal analyses. To overcome this, we develop Hadoop-EDF, a distributed signal processing tool to load EDF data in a parallel manner using Hadoop MapReduce. Hadoop-EDF uses a robust data partition algorithm making EDF data parallelly processable. We evaluate Hadoop-EDF's scalability and performance by leveraging two datasets from the National Sleep Research Resource and running experiments on Amazon Web Service clusters. The performance of Hadoop-EDF on a 20-node cluster achieved about 26 times and 47 times faster than the sequential processing of 200 small-size files and 200 large-size files, respectively. The results demonstrate that Hadoop-EDF is more suitable and effective in processing large EDF files.
| Idioma original | English |
|---|---|
| Título de la publicación alojada | Proceedings - 2019 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2019 |
| Editores | Illhoi Yoo, Jinbo Bi, Xiaohua Tony Hu |
| Páginas | 2265-2271 |
| Número de páginas | 7 |
| ISBN (versión digital) | 9781728118673 |
| DOI | |
| Estado | Published - nov 2019 |
| Evento | 2019 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2019 - San Diego, United States Duración: nov 18 2019 → nov 21 2019 |
Serie de la publicación
| Nombre | Proceedings - 2019 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2019 |
|---|
Conference
| Conference | 2019 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2019 |
|---|---|
| País/Territorio | United States |
| Ciudad | San Diego |
| Período | 11/18/19 → 11/21/19 |
Nota bibliográfica
Publisher Copyright:© 2019 IEEE.
Financiación
This work was supported by the US National Institutes of Health under grants R24HL114473 and U01NS090408. Correspondence: [email protected]
| Financiadores | Número del financiador |
|---|---|
| National Institutes of Health (NIH) | U01NS090408, R24HL114473 |
| National Institutes of Health (NIH) |
ODS de las Naciones Unidas
Este resultado contribuye a los siguientes Objetivos de Desarrollo Sostenible
-
Good health and well being
ASJC Scopus subject areas
- Biochemistry
- Biotechnology
- Molecular Medicine
- Modeling and Simulation
- Health Informatics
- Pharmacology (medical)
- Public Health, Environmental and Occupational Health
Huella
Profundice en los temas de investigación de 'Hadoop-EDF: Large-scale Distributed Processing of Electrophysiological Signal Data in Hadoop MapReduce'. En conjunto forman una huella única.Citar esto
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver