Abstract
Video-based point cloud compression (V-PCC) is a state-of-the-art moving picture experts group (MPEG) standard for point cloud compression. V-PCC can be used to compress both static and dynamic point clouds in a lossless, near lossless, or lossy way. Many objective quality metrics have been proposed for distorted point clouds. Most of these metrics are full-reference metrics that require both the original point cloud and the distorted one. However, in some real-time applications, the original point cloud is not available, and no-reference or reduced-reference quality metrics are needed. Three main challenges in the design of a reduced-reference quality metric are how to build a set of features that characterize the visual quality of the distorted point cloud, how to select the most effective features from this set, and how to map the selected features to a perceptual quality score. We address the first challenge by proposing a comprehensive set of features consisting of compression, geometry, normal, curvature, and luminance features. To deal with the second challenge, we use the least absolute shrinkage and selection operator (LASSO) method, which is a variable selection method for regression problems. Finally, we map the selected features to the mean opinion score in a nonlinear space. Although we have used only 19 features in our current implementation, our metric is flexible enough to allow any number of features, including future more effective ones. Experimental results on the Waterloo point cloud dataset version 2 (WPC2.0) and the MPEG point cloud compression dataset (M-PCCD) show that our method, namely PCQAML, outperforms state-of-the-art full-reference and reduced-reference quality metrics in terms of Pearson linear correlation coefficient, Spearman rank order correlation coefficient, Kendall's rank-order correlation coefficient, and root mean squared error.
| Original language | English |
|---|---|
| Article number | 10375131 |
| Pages (from-to) | 6238-6249 |
| Number of pages | 12 |
| Journal | IEEE Transactions on Multimedia |
| Volume | 26 |
| DOIs | |
| State | Published - 2024 |
Bibliographical note
Publisher Copyright:© 2023 IEEE.
Funding
This work was supported in part by the National Science Foundation of China under Grants 62222110, 62172259, and 62311530104, in part by the National Science Foundation under Grant IIS 2327113, in part by the National Institutes of Health under Grants R21AG070909, P30AG072946, and R01HD101508-01, in part by the High-end Foreign Experts Recruitment Plan of Chinese Ministry of Science and Technology under Grant G2023150003L, in part by the Taishan Scholar Project of Shandong Province under Grant tsqn202103001, and in part by the Shandong Provincial Natural Science Foundation, China, under Grants ZR2022MF275, ZR2022QF076, ZR2022ZD038, ZR2021MF025, and ZR2022ZD38.
| Funders | Funder number |
|---|---|
| Natural Science Foundation of Shandong Province | ZR2021MF025, ZR2022ZD038, ZR2022ZD38, ZR2022QF076, ZR2022MF275 |
| National Institutes of Health (NIH) | R01HD101508-01, P30AG072946, R21AG070909 |
| Taishan Scholar Project of Shandong Province | tsqn202103001 |
| National Science Foundation Arctic Social Science Program | IIS 2327113 |
| National Natural Science Foundation of China (NSFC) | 62311530104, 62172259, 62222110 |
| Ministry of Science and Technology of the People's Republic of China | G2023150003L |
Keywords
- LASSO regression
- Point cloud compression
- feature selection
- perceptual quality metric
- support vector regression
ASJC Scopus subject areas
- Signal Processing
- Media Technology
- Computer Science Applications
- Electrical and Electronic Engineering