End-to-end Integration of Scientific Workflows on Distributed Cyberinfrastructures: Challenges and Lessons Learned with an Earth Science Application

Camila Roa, Mats Rynge, Paula Olaya, Karan Vahi, Todd Miller, James Griffioen, Shelley Knuth, John Goodhue, David Hudak, Alana Romanella, Ricardo Llamas, Rodrigo Vargas, Miron Livny, Ewa Deelman, Michela Taufer

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Distributed cyberinfrastructures (CI) pose opportunities and challenges for the execution of scientific workflows, especially in the context of Earth science applications. They provide heterogeneous resources that can meet the needs of the applications that are part of the scientific workflows and provide the necessary performance and scalability to achieve scientific goals. However, the challenge with distributed CI is that it is difficult to find the right resources for the applications and to orchestrate the workflow execution from resource provisioning to job execution to delivering the final results. In some cases, poor choice of resources may result in slow execution or outright failure. In this paper, we present Advanced Cyberinfrastructure Coordination Ecosystem: Services & Support (ACCESS) Pegasus, a CI solution built as part of the U.S. National Science Foundation ACCESS program that provides automated execution of scientific applications. We demonstrate Pegasus's capabilities with SOil MOisture SPatial Inference Engine (SOMOSPIE), an earth science multi-component application for fine-grained soil moisture predictions. We identify a roadmap to migrate applications such as SOMOSPIE on ACCESS resources with the support of ACCESS Pegasus, outlining both strengths and weaknesses of this approach.

Original languageEnglish
Title of host publication16th IEEE/ACM International Conference on Utility and Cloud Computing, UCC 2023
ISBN (Electronic)9798400702341
DOIs
StatePublished - Dec 4 2023
Event16th IEEE/ACM International Conference on Utility and Cloud Computing, UCC 2023 - Taormina, Italy
Duration: Dec 4 2023Dec 7 2023

Publication series

Name16th IEEE/ACM International Conference on Utility and Cloud Computing, UCC 2023

Conference

Conference16th IEEE/ACM International Conference on Utility and Cloud Computing, UCC 2023
Country/TerritoryItaly
CityTaormina
Period12/4/2312/7/23

Bibliographical note

Publisher Copyright:
© 2023 Copyright is held by the owner/author(s). Publication rights licensed to ACM.

Keywords

  • containers
  • high throughput computing
  • machine learning
  • soil moisture
  • workflows

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'End-to-end Integration of Scientific Workflows on Distributed Cyberinfrastructures: Challenges and Lessons Learned with an Earth Science Application'. Together they form a unique fingerprint.

Cite this