Characterization and Detection of Artifacts for Error-Controlled Lossy Compressors

Pu Jiao, Sheng Di, Jinyang Liu, Xin Liang, Franck Cappello

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Today's scientific high-performance computing (HPC) applications are often running on large-scale environments, producing extremely large volumes of data that need to be compressed effectively for efficient storage or data transfer. Error-bounded lossy compression is arguably the most efficient way to this end, because it can get very high compression ratios while controlling the data distortion strictly based on user requirements for compression errors. However, error-bounded lossy compressors may have serious artifact issues in situations with relatively large error bound or high compression ratios, which is highly undesirable to users. In this paper, we compre-hensively characterize the artifacts for multiple state-of-the-art error-bounded lossy compressors (including SZ-1.4, SZ-2.1, SZ-3.0, FPZIP, ZFP, MGARD) and provide an in-depth analysis for the root cause of these artifacts. We summarize the artifact issue into three types and also develop an efficient artifact detection algorithm for each type of artifact. We finally evaluate our artifact detection methods using four scientific datasets, which demonstrates that the proposed methods are able to detect artifact issues under linear time complexity.

Original languageEnglish
Title of host publicationProceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023
Pages117-126
Number of pages10
ISBN (Electronic)9798350383225
DOIs
StatePublished - 2023
Event30th Annual IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2023 - Goa, India
Duration: Dec 18 2023Dec 21 2023

Publication series

NameProceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023

Conference

Conference30th Annual IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2023
Country/TerritoryIndia
CityGoa
Period12/18/2312/21/23

Bibliographical note

Publisher Copyright:
© 2023 IEEE.

Keywords

  • High-performance computing
  • compression artifacts
  • lossy compression
  • scientific data

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications
  • Hardware and Architecture
  • Information Systems
  • Information Systems and Management

Fingerprint

Dive into the research topics of 'Characterization and Detection of Artifacts for Error-Controlled Lossy Compressors'. Together they form a unique fingerprint.

Cite this