Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Optimizing Lossy Compression with Adjacent Snapshots for N-body Simulation Data

  • Sihuan Li
  • , Sheng Di
  • , Xin Liang
  • , Zizhong Chen
  • , Franck Cappello

Producción científica: Conference contributionrevisión exhaustiva

17 Citas (Scopus)

Resumen

Today's N-body simulations are producing extremely large amounts of data. The Hardware/Hybrid Accelerated Cosmology Code (HACC), for example, may simulate trillions of particles, producing tens of petabytes of data to store in a parallel file system, according to the HACC users. In this paper, we design and implement an efficient, in situ error-bounded lossy compressor to significantly reduce the data size for N-body simulations. Not only can our compressor save significant storage space for N-body simulation researchers, but it can also improve the I/O performance considerably with limited memory and computation overhead. Our contribution is threefold. (1) We propose an efficient data compression model by leveraging the consecutiveness of the cosmological data in both space and time dimensions as well as the physical correlation across different fields. (2) We propose a lightweight, efficient alignment mechanism to align the disordered particles across adjacent snapshots in the simulation, which is a fundamental step in the whole compression procedure. We also optimize the compression quality by exploring best-fit data prediction strategies and optimizing the frequencies of the space-based compression vs. time-based compression. (3) We evaluate our compressor using both a cosmological simulation package and molecular dynamics simulation data - two major categories in the N-body simulation domain. Experiments show that under the same distortion of data, our solution produces up to 43% higher compression ratios on the velocity field and up to 300% higher on the position field than do other state-of-the-art compressors (including SZ, ZFP, NUMARCK, and decimation). With our compressor, the overall I/O time on HACC data is reduced by up to 20% compared with the second-best compressor.

Idioma originalEnglish
Título de la publicación alojadaProceedings - 2018 IEEE International Conference on Big Data, Big Data 2018
EditoresNaoki Abe, Huan Liu, Calton Pu, Xiaohua Hu, Nesreen Ahmed, Mu Qiao, Yang Song, Donald Kossmann, Bing Liu, Kisung Lee, Jiliang Tang, Jingrui He, Jeffrey Saltz
Páginas428-437
Número de páginas10
ISBN (versión digital)9781538650356
DOI
EstadoPublished - jul 2 2018
Evento2018 IEEE International Conference on Big Data, Big Data 2018 - Seattle, United States
Duración: dic 10 2018dic 13 2018

Serie de la publicación

NombreProceedings - 2018 IEEE International Conference on Big Data, Big Data 2018

Conference

Conference2018 IEEE International Conference on Big Data, Big Data 2018
País/TerritorioUnited States
CiudadSeattle
Período12/10/1812/13/18

Nota bibliográfica

Publisher Copyright:
© 2018 IEEE.

Financiación

ACKNOWLEDGMENTS This research was supported by the Exascale Computing Project (ECP), Project Number: 17-SC-20-SC, a collaborative effort of two DOE organizations – the Office of Science and the National Nuclear Security Administration, responsible for the planning and preparation of a capable exascale ecosystem, including software, applications, hardware, advanced system engineering and early testbed platforms, to support the nations exascale computing imperative. The material was supported by the U.S. Department of Energy, Office of Science, under contract DE-AC02-06CH11357, and supported by the National Science Foundation under Grant No. 1619253. This research is also supported by NSF Award No. 1513201. We acknowledge the computing resources provided on Bebop, which is operated by the Laboratory Computing Resource Center at Argonne National Laboratory.

FinanciadoresNúmero del financiador
National Science Foundation (NSF)1513201, 1619253
Michigan State University-U.S. Department of Energy (MSU-DOE) Plant Research LaboratoryDE-AC02-06CH11357
Office of Science Programs
National Nuclear Security Administration

    ASJC Scopus subject areas

    • Computer Science Applications
    • Information Systems

    Huella

    Profundice en los temas de investigación de 'Optimizing Lossy Compression with Adjacent Snapshots for N-body Simulation Data'. En conjunto forman una huella única.

    Citar esto