TspSZ: An Efficient Parallel Error-Bounded Lossy Compressor for Topological Skeleton Preservation

Mingze Xia, Bei Wang, Yuxiao Li, Pu Jiao, Xin Liang, Hanqi Guo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Data compression is a powerful solution for addressing big data challenges in database and data management. In scientific data compression for vector fields, preserving topological information is essential for accurate analysis and visualization. The topological skeleton, a fundamental component of vector field topology, consists of critical points and their connectivity (i.e., separatrices). While previous work has focused on preserving critical points in error-controlled lossy compression, little attention has been given to preserving separatrices, which are equally important. In this work, we introduce TspSZ, an efficient error-bounded lossy compression framework designed to preserve both critical points and separatrices. Our key contributions are threefold. First, we propose TspSZ, a topological-skeleton-preserving lossy compression framework that integrates two algorithms, enabling existing critical-point-preserving compressors to also retain separatrices, significantly enhancing their topology preservation capabilities. Second, we optimize TspSZ for efficiency through tailored improvements and parallelization. Specifically, we introduce a new error control mechanism to achieve high compression ratios and implement a shared-memory parallelization strategy to boost compression throughput. Third, we evaluate TspSZ against state-of-the-art lossy and lossless compressors using four real-world scientific datasets. Experimental results show that TspSZ achieves compression ratios of up to 7.7× while effectively preserving the topological skeleton, ensuring efficient storage and transmission of scientific data without compromising topological integrity.

Original languageEnglish
Title of host publicationProceedings - 2025 IEEE 41st International Conference on Data Engineering, ICDE 2025
Pages3682-3695
Number of pages14
ISBN (Electronic)9798331536039
DOIs
StatePublished - 2025
Event41st IEEE International Conference on Data Engineering, ICDE 2025 - Hong Kong, China
Duration: May 19 2025May 23 2025

Publication series

NameProceedings - International Conference on Data Engineering
ISSN (Print)1084-4627
ISSN (Electronic)2375-0286

Conference

Conference41st IEEE International Conference on Data Engineering, ICDE 2025
Country/TerritoryChina
CityHong Kong
Period5/19/255/23/25

Bibliographical note

Publisher Copyright:
© 2025 IEEE.

Funding

This work was partially supported by grants from NSF OAC-2330367, OAC-2311756, OAC-2313122, OAC-2313123, and OAC-2313124. We would like to thank the University of Kentucky Center for Computational Sciences and Information Technology Services Research Computing for its support and use of the Lipscomb Compute Cluster, Morgan Compute Cluster, and associated research computing resources.

FundersFunder number
Kentucky Transportation Center, University of Kentucky
National Science Foundation Arctic Social Science ProgramOAC-2330367, OAC-2311756, OAC-2313124, OAC-2313123, OAC-2313122

    Keywords

    • high-performance computing
    • lossy compression
    • Scientific data management
    • vector field topology

    ASJC Scopus subject areas

    • Software
    • Signal Processing
    • Information Systems

    Fingerprint

    Dive into the research topics of 'TspSZ: An Efficient Parallel Error-Bounded Lossy Compressor for Topological Skeleton Preservation'. Together they form a unique fingerprint.

    Cite this