Deep Learning-Based Point Cloud Coding and Super-Resolution: a Joint Geometry and Color Approach
Guarda, A.
; Ruivo, M.
; Coelho, L.
;
Seleem, A.
;
Rodrigues, Nuno M. M.
;
Pereira, F.
IEEE Transactions on Multimedia Vol. , Nº , pp. 1 - 13, , 2024.
ISSN (print): 1520-9210
ISSN (online): 1941-0077
Scimago Journal Ranking: 2,26 (in 2023)
Digital Object Identifier: 10.1109/TMM.2023.3338081
Abstract
In this golden age of multimedia, realistic content is in high demand with users seeking more immersive and interactive experiences. As a result, new image modalities for 3D representations have emerged in recent years, among which point clouds have deserved especial attention. Naturally, with this increase in demand, efficient storage and transmission became a must, with standardization groups such as MPEG and JPEG entering the scene, as it happened before with other types of visual media. In a surprising development, JPEG issued a Call for Proposals on point cloud coding targeting exclusively learning-based solutions, in parallel to a similar call for image coding. This is a natural consequence of the growing popularity of deep learning, which due to its excellent performances is currently dominant in the multimedia processing field, including coding. This paper presents the coding solution selected by JPEG as the best-performing response to the Call for Proposals and adopted as the first version of the JPEG Pleno Point Cloud Coding Verification Model, in practice the first step for developing a standard. The proposed solution offers a novel joint geometry and color approach for point cloud coding, in which a single deep learning model processes both geometry and color simultaneously. To maximize the RD performance for a large range of point clouds, the proposed solution uses down-sampling and learning-based super-resolution as pre- and post-processing steps. Compared to the MPEG point cloud coding standards, the proposed coding solution comfortably outperforms G-PCC, for both geometry, color, and joint quality metrics.