An enhanced model-based checkpointing protocol

Jiang Wu, Yi Luo, D. Manivannan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Checkpointing and rollback recovery are widely used techniques to handle failures in distributed computing systems. Usually we avoid taking checkpoints that are useless during the recovery process. Communication-Induced checkpointing algorithms guarantee the usefulness of all the checkpoints and provide considerable autonomy with relatively low overhead. In this paper, we propose an enhanced Communication-Induced checkpointing algorithm. Our algorithm is likely to have less checkpointing overhead than an existing algorithm in the literature.

Original languageEnglish
Title of host publicationProceedings of the IASTED International Conference on Parallel and Distributed Computing and Networks, PDCN 2007
Pages332-337
Number of pages6
StatePublished - 2007
EventIASTED International Conference on Parallel and Distributed Computing and Networks, PDCN 2007 - Innsbruck, Austria
Duration: Feb 13 2007Feb 15 2007

Publication series

NameProceedings of the IASTED International Conference on Parallel and Distributed Computing and Systems
ISSN (Print)1027-2658

Conference

ConferenceIASTED International Conference on Parallel and Distributed Computing and Networks, PDCN 2007
Country/TerritoryAustria
CityInnsbruck
Period2/13/072/15/07

Keywords

  • Checkpointing
  • Recovery
  • Useless checkpoints

ASJC Scopus subject areas

  • Software
  • Hardware and Architecture
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'An enhanced model-based checkpointing protocol'. Together they form a unique fingerprint.

Cite this