Checkpointing with multicast communication

James E. Lumpp, William R. Dieter

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

For long-running or large-scale distributed programs, the ability to provide software fault-tolerance via checkpointing is valuable. For scalable systems, multicast communication is becoming a predominant communication paradigm. While some aspects of consistency and channel state are the same for both unicast and multicast protocols, the implementation of checkpointing systems differ. This paper explores the problem of checkpointing in a multicast environment and introduces two checkpointing algorithms for such environments. The first algorithm is closely based on existing checkpointing algorithms. The second employs the multicast protocol to distribute checkpointing information efficiently.

Original languageEnglish
Title of host publication1998 IEEE Aerospace Conference, AERO 1998 - Proceedings
Pages467-479
Number of pages13
DOIs
StatePublished - 1998
Event1998 IEEE Aerospace Conference, AERO 1998 - Snowmass, United States
Duration: Mar 28 1998Mar 28 1998

Publication series

NameIEEE Aerospace Conference Proceedings
Volume4
ISSN (Print)1095-323X

Conference

Conference1998 IEEE Aerospace Conference, AERO 1998
Country/TerritoryUnited States
CitySnowmass
Period3/28/983/28/98

Bibliographical note

Publisher Copyright:
© 1998 IEEE.

ASJC Scopus subject areas

  • Aerospace Engineering
  • Space and Planetary Science

Fingerprint

Dive into the research topics of 'Checkpointing with multicast communication'. Together they form a unique fingerprint.

Cite this