Res-NERV : Residual blocks for a practical implicit neural video decoder
Marwa Tarchouli  1, 2@  , Thomas Guionnet  1@  , Marc Riviere  1@  , Wassim Hamidouche  2@  , Meriem Outtas  2@  , Olivier Deforges  3@  
1 : ATEME [Rennes]
ATEME [Rennes]
2 : IETR
INSA Rennes
3 : Institut dÉlectronique et de Télécommunications de Rennes  (IETR)
Université de Nantes, Universite de Rennes 1, Institut National des Sciences Appliquées - Rennes, CentraleSupélec, Centre National de la Recherche Scientifique : UMR6164
Campus de Beaulieu Bâtiment 11D263 Av.Général Leclerc-CS 74205 35042 Rennes Cedex -  France

This paper proposes the integration of residual blocks into neural representation for videos (NeRV)-based architectures with the aim of enhancing the reconstruction of detailed patterns and high-level features. Additionally, a coding pipeline is introduced, placing the implicit neural decoder in a real-life video streaming framework. Indeed, DeepCABAC is employed for model compression, applying a quantization scheme followed by the context-adaptive binary arithmetic coding (CABAC) entropy coding algorithm, ultimately leading to bitstream generation. Our method outperforms NeRV, as well as x264 and x265, achieving BD-rate gains against NeRV: -12.06% using PSNR and -14.25% using MS-SSIM. Furthermore, it exhibits superior subjective quality compared to NeRV, attributed to enhanced high-level feature reconstruction. This observed behavior encourages the application of our method to other NeRV-based models, such as E-NeRV. This paper has been accepted at the 2024 IEEE International Conference on Image Processing.


Personnes connectées : 3 Vie privée
Chargement...