Efficient Lossy Compression for Scientific Data Based on Pointwise Relative Error Bound

Sheng Di, Dingwen Tao, Xin Liang, Franck Cappello

Producción científica: Articlerevisión exhaustiva

19 Citas (Scopus)

Resumen

An effective data compressor is becoming increasingly critical to today's scientific research, and many lossy compressors are developed in the context of absolute error bounds. Based on physical/chemical definitions of simulation fields or multiresolution demand, however, many scientific applications need to compress the data with a pointwise relative error bound (i.e., the smaller the data value, the smaller the compression error to tolerate). To this end, we propose two optimized lossy compression strategies under a state-of-the-art three-staged compression framework (prediction + quantization + entropy-encoding). The first strategy (called block-based strategy) splits the data set into many small blocks and computes an absolute error bound for each block, so it is particularly suitable for the data with relatively high consecutiveness in space. The second strategy (called multi-threshold-based strategy) splits the whole value range into multiple groups with exponentially increasing thresholds and performs the compression in each group separately, which is particularly suitable for the data with a relatively large value range and spiky value changes. We implement the two strategies rigorously and evaluate them comprehensively by using two scientific applications which both require lossy compression with point-wise relative error bound. Experiments show that the two strategies exhibit the best compression qualities on different types of data sets respectively. The compression ratio of our lossy compressor is higher than that of other state-of-the-art compressors by 17.2-618 percent on the climate simulation data and 30-210 percent on the N-body simulation data, with the same relative error bound and without degradation of the overall visualization effect of the entire data.

Idioma originalEnglish
Número de artículo8421751
Páginas (desde-hasta)331-345
Número de páginas15
PublicaciónIEEE Transactions on Parallel and Distributed Systems
Volumen30
N.º2
DOI
EstadoPublished - feb 1 2019

Nota bibliográfica

Publisher Copyright:
© 2018 IEEE.

Financiación

This research was supported by the Exascale Computing Project (ECP), Project Number: 17-SC-20-SC, a collaborative effort of two DOE organizations - the Office of Science and the National Nuclear Security Administration, responsible for the planning and preparation of a capable exascale ecosystem, including software, applications, hardware, advanced system engineering and early testbed platforms, to support the nation's exascale computing imperative. The material was supported by the U.S. Department of Energy, Office of Science, under contract DE-AC02-06CH11357, and supported by theUSNational Science Foundation under GrantNo. 1619253. This research was supported by the Exascale Computing Project (ECP), Project Number: 17-SC-20-SC, a collaborative effort of two DOE organizations – the Office of Science and the National Nuclear Security Administration, responsible for the planning and preparation of a capable exascale ecosystem, including software, applications, hardware, advanced system engineering and early testbed platforms, to support the nation’s exascale computing imperative. The material was supported by the U.S. Department of Energy, Office of Science, under contract DE-AC02-06CH11357, and supported by the US National Science Foundation under Grant No. 1619253.

FinanciadoresNúmero del financiador
National Science Foundation Science & Technology Center
US National Science Foundation1619253
Michigan State University-U.S. Department of Energy (MSU-DOE) Plant Research LaboratoryDE-AC02-06CH11357
Office of Science Programs
National Nuclear Security Administration

    ASJC Scopus subject areas

    • Signal Processing
    • Hardware and Architecture
    • Computational Theory and Mathematics

    Huella

    Profundice en los temas de investigación de 'Efficient Lossy Compression for Scientific Data Based on Pointwise Relative Error Bound'. En conjunto forman una huella única.

    Citar esto