Abstract
Data compression is a powerful solution for addressing big data challenges in database and data management. In scientific data compression for vector fields, preserving topological information is essential for accurate analysis and visualization. The topological skeleton, a fundamental component of vector field topology, consists of critical points and their connectivity (i.e., separatrices). While previous work has focused on preserving critical points in error-controlled lossy compression, little attention has been given to preserving separatrices, which are equally important. In this work, we introduce TspSZ, an efficient error-bounded lossy compression framework designed to preserve both critical points and separatrices. Our key contributions are threefold. First, we propose TspSZ, a topological-skeleton-preserving lossy compression framework that integrates two algorithms, enabling existing critical-point-preserving compressors to also retain separatrices, significantly enhancing their topology preservation capabilities. Second, we optimize TspSZ for efficiency through tailored improvements and parallelization. Specifically, we introduce a new error control mechanism to achieve high compression ratios and implement a shared-memory parallelization strategy to boost compression throughput. Third, we evaluate TspSZ against state-of-the-art lossy and lossless compressors using four real-world scientific datasets. Experimental results show that TspSZ achieves compression ratios of up to 7.7× while effectively preserving the topological skeleton, ensuring efficient storage and transmission of scientific data without compromising topological integrity.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2025 IEEE 41st International Conference on Data Engineering, ICDE 2025 |
| Pages | 3682-3695 |
| Number of pages | 14 |
| ISBN (Electronic) | 9798331536039 |
| DOIs | |
| State | Published - 2025 |
| Event | 41st IEEE International Conference on Data Engineering, ICDE 2025 - Hong Kong, China Duration: May 19 2025 → May 23 2025 |
Publication series
| Name | Proceedings - International Conference on Data Engineering |
|---|---|
| ISSN (Print) | 1084-4627 |
| ISSN (Electronic) | 2375-0286 |
Conference
| Conference | 41st IEEE International Conference on Data Engineering, ICDE 2025 |
|---|---|
| Country/Territory | China |
| City | Hong Kong |
| Period | 5/19/25 → 5/23/25 |
Bibliographical note
Publisher Copyright:© 2025 IEEE.
Funding
This work was partially supported by grants from NSF OAC-2330367, OAC-2311756, OAC-2313122, OAC-2313123, and OAC-2313124. We would like to thank the University of Kentucky Center for Computational Sciences and Information Technology Services Research Computing for its support and use of the Lipscomb Compute Cluster, Morgan Compute Cluster, and associated research computing resources.
| Funders | Funder number |
|---|---|
| Kentucky Transportation Center, University of Kentucky | |
| National Science Foundation Arctic Social Science Program | OAC-2330367, OAC-2311756, OAC-2313124, OAC-2313123, OAC-2313122 |
Keywords
- high-performance computing
- lossy compression
- Scientific data management
- vector field topology
ASJC Scopus subject areas
- Software
- Signal Processing
- Information Systems