COmpression et REprésentation des Signaux Audiovisuels

sciencesconf.org:coresa2024:569140

Res-NERV : Residual blocks for a practical implicit neural video decoder

Marwa Tarchouli 1, 2, @ , Thomas Guionnet 1, @ , Marc Riviere 1, @ , Wassim Hamidouche 2, @ , Meriem Outtas 2, @ , Olivier Deforges 3, @

1 : ATEME [Rennes]

ATEME [Rennes]

2 : IETR

INSA Rennes

3 : Institut dÉlectronique et de Télécommunications de Rennes (IETR)

Université de Nantes, Universite de Rennes 1, Institut National des Sciences Appliquées - Rennes, CentraleSupélec, Centre National de la Recherche Scientifique : UMR6164

Campus de Beaulieu Bâtiment 11D263 Av.Général Leclerc-CS 74205 35042 Rennes Cedex - France

This paper proposes the integration of residual blocks into neural representation for videos (NeRV)-based architectures with the aim of enhancing the reconstruction of detailed patterns and high-level features. Additionally, a coding pipeline is introduced, placing the implicit neural decoder in a real-life video streaming framework. Indeed, DeepCABAC is employed for model compression, applying a quantization scheme followed by the context-adaptive binary arithmetic coding (CABAC) entropy coding algorithm, ultimately leading to bitstream generation. Our method outperforms NeRV, as well as x264 and x265, achieving BD-rate gains against NeRV: -12.06% using PSNR and -14.25% using MS-SSIM. Furthermore, it exhibits superior subjective quality compared to NeRV, attributed to enhanced high-level feature reconstruction. This observed behavior encourages the application of our method to other NeRV-based models, such as E-NeRV. This paper has been accepted at the 2024 IEEE International Conference on Image Processing.

Type :	:	Résumé (2 pages)
Thématiques	:	Poster (articles résumés)

Vie privée | Accessibilité