中文RT-NeRV:通过残差标记化重新思考视频的混合神经表示
ENRT-NeRV: Rethinking Hybrid Neural Representations for Video via Residual Tokenization
NeRV将视频表示为紧凑神经网络,实现高效压缩。混合方法通过内容自适应嵌入提升重建质量,但低比特率下细节保留不足,因浅层残差信息连续传输成本高。本文重新思考混合设计,优化残差利用。
arXiv:2403.12401v2 Announce Type: replace Abstract: Neural Representations for Videos(NeRV) have emerged as a promising paradigm for video compression by representing videos as compact neural networks with efficient decoding. Hybrid NeRV methods further improve reconstruction quality through content adaptive embeddings, but still struggle to preserve fine details at low bitrates. A key limitation is that shallow residual support in formation, although highly beneficial for reconstruction, is costly to transmit in its continuous form and is therefore underutilized. In this paper, we rethink hyb