FINE: A Fully Informed aNd Efficient communication-induced checkpointing protocol for distributed systems

Research output: Contribution to journalArticlepeer-review

28 Scopus citations

Abstract

Communication-Induced Checkpointing (CIC) protocols are classified into two categories in the literature: Index-based and Model-based. In this paper, we discuss two data structures being used in these two kinds of CIC protocols, and their different roles in helping the checkpointing algorithms to enforce Z-cycle Free (ZCF) property. Then, we present our Fully Informed aNd Efficient (FINE) communication-induced checkpointing algorithm, which not only has less checkpointing overhead than the well-known Fully Informed (FI) CIC protocol proposed by Helary et al. but also has less message overhead. Performance evaluation indicates that our protocol performs better than many of the other existing CIC protocols.

Original languageEnglish
Pages (from-to)153-167
Number of pages15
JournalJournal of Parallel and Distributed Computing
Volume69
Issue number2
DOIs
StatePublished - Feb 2009

Bibliographical note

Funding Information:
A preliminary version of this paper [15] has been presented in the International Conference on Systems (ICONS’08). This material is based in part upon work supported by the US National science Foundation under Grant No. IIS-0414791. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.

Funding Information:
D. Manivannan is currently an associate professor in the Department of Computer Science at the University of Kentucky, Lexington, USA. Dr. Manivannan received the Ph.D. degree in Computer and Information Science from The Ohio State University, Columbus, Ohio, in 1997. His areas of research interests include fault-tolerance and synchronization in distributed systems, routing in wormhole networks, channel allocation in cellular networks and wireless networks. He served as program committee member for several International Conferences and also served as reviewer for several International Journals published by ACM, IEEE, Elsevier, Springer, Oxford University Press and others. He is on the Editorial Board of Information Sciences journal, published by Elsevier and Wireless Personal Communications journal, published by Springer. He is also an Associate Technical Editor for IEEE Communications Magazine. Information about his publications and research can be found at his lab site http://dmcslab.netlab.uky.edu/ . Dr. Manivannan is a recipient of the CAREER Award from the US National Science Foundation. He is a senior member of the ACM, and a member of ACM SIGMOBILE, IEEE, and IEEE Computer Society.

Keywords

  • Communication-induced checkpointing protocols
  • Consistent global checkpoints
  • Distributed systems

ASJC Scopus subject areas

  • Software
  • Theoretical Computer Science
  • Hardware and Architecture
  • Computer Networks and Communications
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'FINE: A Fully Informed aNd Efficient communication-induced checkpointing protocol for distributed systems'. Together they form a unique fingerprint.

Cite this