Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation

Jose Caballero,C. Ledig,Andrew P. Aitken,Alejandro Acosta,J. Totz,Zehan Wang,Wenzhe Shi

Published 2016 in Computer Vision and Pattern Recognition

ABSTRACT

Convolutional neural networks have enabled accurate image super-resolution in real-time. However, recent attempts to benefit from temporal correlations in video super-resolution have been limited to naive or inefficient architectures. In this paper, we introduce spatio-temporal sub-pixel convolution networks that effectively exploit temporal redundancies and improve reconstruction accuracy while maintaining real-time speed. Specifically, we discuss the use of early fusion, slow fusion and 3D convolutions for the joint processing of multiple consecutive video frames. We also propose a novel joint motion compensation and video super-resolution algorithm that is orders of magnitude more efficient than competing methods, relying on a fast multi-resolution spatial transformer module that is end-to-end trainable. These contributions provide both higher accuracy and temporally more consistent videos, which we confirm qualitatively and quantitatively. Relative to single-frame models, spatio-temporal networks can either reduce the computational cost by 30% whilst maintaining the same quality or provide a 0.2dB gain for a similar computational cost. Results on publicly available datasets demonstrate that the proposed algorithms surpass current state-of-the-art performance in both accuracy and efficiency.

PUBLICATION RECORD

Publication year
2016
Venue
Computer Vision and Pattern Recognition
Publication date
2016-11-16
Fields of study
Computer Science, Engineering
Identifiers
DOI 10.1109/CVPR.2017.304 arXiv 1611.05250
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Perceptual Losses for Real-Time Style Transfer and Super-Resolution
2016cited by this paper
gvnn: Neural Network Library for Geometric Computer Vision
2016cited by this paper
Accelerating the Super-Resolution Convolutional Neural Network
2016cited by this paper
Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network
2016cited by this paper
FAST: Free Adaptive Super-Resolution via Transfer for Compressed Videos
2016cited by this paper
Identity Mappings in Deep Residual Networks
2016cited by this paper
Generating Images with Perceptual Similarity Metrics based on Deep Networks
2016cited by this paper
DeepWarp: Photorealistic Image Resynthesis for Gaze Manipulation
2016cited by this paper
Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network
2016influential reference
Unsupervised convolutional neural networks for motion estimation
2016cited by this paper
Video Super-Resolution With Convolutional Neural Networks
2016influential reference
Learning-based view synthesis for light field cameras
2016cited by this paper
International Conference on Learning Representations (ICLR)
2016cited by this paper
Bidirectional Recurrent Convolutional Networks for Multi-Frame Super-Resolution
2015cited by this paper
Super-Resolution with Deep Convolutional Sufficient Statistics
2015cited by this paper
FlowNet: Learning Optical Flow with Convolutional Networks
2015cited by this paper
Deep Residual Learning for Image Recognition
2015cited by this paper
Deeply-Recursive Convolutional Network for Image Super-Resolution
2015cited by this paper
Spatio-temporal video autoencoder with differentiable memory
2015cited by this paper
Compression Artifacts Reduction by a Deep Convolutional Network
2015cited by this paper
Spatial Transformer Networks
2015cited by this paper
Dictionary-based multiple frame video super-resolution
2015cited by this paper
Image Super-Resolution Using Deep Convolutional Networks
2014influential reference
Large-Scale Video Classification with Convolutional Neural Networks
2014influential reference
Learning Spatiotemporal Features with 3D Convolutional Networks
2014influential reference
Adam: A Method for Stochastic Optimization
2014cited by this paper
Learning a Deep Convolutional Network for Image Super-Resolution
2014cited by this paper
Cardiac Image Super-Resolution with Global Correspondence Using Multi-Atlas PatchMatch
2013cited by this paper
Exact solutions to the nonlinear dynamics of learning in deep linear neural networks
2013cited by this paper
No-Reference Image Quality Assessment in the Spatial Domain
2012cited by this paper
Model recommendation for action recognition
2012cited by this paper
Coupled Dictionary Training for Image Super-Resolution
2012cited by this paper
A Bayesian approach to adaptive video super resolution
2011cited by this paper
Discrete Wavelet Transform-Based Satellite Image Resolution Enhancement
2011cited by this paper
Space-time super-resolution from a single video
2011cited by this paper
The Consumer Digital Video Library
2010cited by this paper
Motion Tuned Spatio-Temporal Quality Assessment of Natural Videos
2010cited by this paper
Image Super-Resolution Via Sparse Representation
2010cited by this paper
A Survey on Transfer Learning
2010cited by this paper
On Single Image Scale-Up Using Sparse-Representations
2010cited by this paper
Nonparametric scene parsing: Label transfer via dense scene alignment
2009cited by this paper
Super-Resolution Without Explicit Subpixel Motion Estimation
2009cited by this paper
Super-resolution from a single image
2009cited by this paper
Example-Based Learning for Single-Image Super-Resolution
2008cited by this paper
Advances in Neural Information Processing Systems (NIPS)
2007cited by this paper
Kernel Regression for Image Processing and Reconstruction
2007cited by this paper
Learning a similarity metric discriminatively, with application to face verification
2005cited by this paper
High Accuracy Optical Flow Estimation Based on a Theory for Warping
2004cited by this paper
Image quality assessment: from error visibility to structural similarity
2004cited by this paper
Super-resolution image reconstruction: a technical overview
2003cited by this paper
Two-Frame Motion Estimation Based on Polynomial Expansion
2003cited by this paper
Eigenface-domain super-resolution for face recognition
2003cited by this paper
Constrained K-means Clustering with Background Knowledge
2001cited by this paper
Digital Video Library.
2000cited by this paper
Ieee Transactions on Pattern Analysis and Machine Intelligence 1 80 Million Tiny Images: a Large Dataset for Non-parametric Object and Scene Recognition
year unknowncited by this paper

CITED BY

Controllable Reference-Guided Diffusion With Local–Global Fusion for Real-World Remote Sensing Super Resolution
2026cites this paper
LIF-VSR: A Lightweight Framework for Video Super-Resolution with Implicit Alignment and Attentional Fusion
2026cites this paper
VEGAN: CCTV video quality enhancement with GAN-based foreground separation and super-resolution
2026cites this paper
EBRNet: Lightweight Enhanced Bidirectional Recurrent Network for Satellite Video Super-Resolution
2026cites this paper
Video super-resolution for real-time rendering: decoupled G-buffer guidance with attention-enhanced modulation
2026cites this paper
NSBRNet: Non-Local Spatio-Temporal Bidirectional Recurrent Network for Satellite Video Super-Resolution
2026cites this paper
Multi-frame remote sensing super-resolution method based on dual-stream feature enhancement
2025cites this paper
Camera Sensor Raw Data-Driven Video Blur Effect Prevention: Dataset and Study
2025cites this paper
Vivid-VR: Distilling Concepts from Text-to-Video Diffusion Transformer for Photorealistic Video Restoration
2025cites this paper
Supervised multi-frame dual-channel denoising enables long-term single-molecule FRET under extremely low photon budget
2025cites this paper
VMG: Rethinking U-Net Architecture for Video Super-Resolution
2025cites this paper
Resampling video super-resolution based on multi-scale guided optical flow
2025cites this paper
Super-Resolving Dynamic Scenes With Spike Camera via Multi-Frame Sequential Alignment With Motion Propagation
2025cites this paper
Enhancing Video Quality Using a Multi-Domain Spatio-Temporal Deformable Fusion Approach
2025cites this paper
Physics-Informed Video Flare Synthesis and Removal Leveraging Motion Independence between Flare and Scene
2025cites this paper
BF-STVSR: B-Splines and Fourier—Best Friends for High Fidelity Spatial-Temporal Video Super-Resolution
2025cites this paper
Efficient Trajectory Space-Time Super-Resolution for Fast Live-cell Imaging
2025cites this paper
Taming High-Resolution Auxiliary G-Buffers for Deep Supersampling of Rendered Content
2025cites this paper
Detail Enhanced Gaussian Splatting for Large-Scale Volumetric Capture
2025cites this paper
Secure AI-Driven Super-Resolution for Real-Time Mixed Reality Applications
2025cites this paper
Bidirectional spatio-temporal generative adversarial network for video super-resolution
2025cites this paper
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
2025cites this paper
Small Clips, Big Gains: Learning Long-Range Refocused Temporal Information for Video Super-Resolution
2025cites this paper
STCDiT: Spatio-Temporally Consistent Diffusion Transformer for High-Quality Video Super-Resolution
2025cites this paper
Low Resource Video Super-resolution using Memory and Residual Deformable Convolutions
2025cites this paper
Implicit Neural Representation for Video and Image Super-Resolution
2025cites this paper
Application of Video Super-Resolution Reconstruction Algorithm for Luojia3-01 Satellite
2025cites this paper
A Deformable Convolutional Neural Network for Video Super‐Resolution
2025cites this paper
Deep-Learning-Empowered Super Resolution: A Comprehensive Survey and Future Prospects
2025cites this paper
Omnidirectional Video Super-Resolution Using Deep Learning
2025cites this paper
BasicAVSR: Arbitrary-Scale Video Super-Resolution via Image Priors and Enhanced Motion Compensation
2025cites this paper
Efficient Video Super-Resolution for Real-time Rendering with Decoupled G-buffer Guidance
2025cites this paper
Rethinking Compressive Sensing: A Compression Framework for Video Super-Resolution
2025cites this paper
DiTVR: Zero-Shot Diffusion Transformer for Video Restoration
2025cites this paper
Multitemporal Difference and Dynamic Optimization Framework for Multiscale Motion Satellite Video Super-Resolution
2025cites this paper
3D Enhanced Residual CNN for Video Super-Resolution Network
2025cites this paper
Gather-Scatter Mamba: Accelerating Propagation with Efficient State Space Model
2025cites this paper
Deep Learning for Regular Raster Spatio-Temporal Prediction: An Overview
2025cites this paper
A Survey on Intelligent Solutions for Increased Video Delivery Quality in Cloud–Edge–End Networks
2025cites this paper
4D-STDF: Compressed Video Quality Enhancement with 3D Spatio-Temporal Fusion and Deformable Convolution
2025cites this paper
Fusing Multi-Temporal Context for Image Super-Resolution Reconstruction in Cultural Heritage Monitoring
2025cites this paper
A Survey of Deep-Learning-Based Compressed Video Quality Enhancement
2025cites this paper
Speech-aided facial video super resolution with accurate lip motion and enhanced frequency details
2025cites this paper
RenderGAN: Enhancing Real-time Rendering Efficiency with Deep Learning
2025cites this paper
PMQ-VE: Progressive Multi-Frame Quantization for Video Enhancement
2025cites this paper
Joint Video Enhancement with Deblurring, Super-Resolution, and Frame Interpolation Network
2025cites this paper
Multi-Axis Feature Diversity Enhancement for Remote Sensing Video Super-Resolution
2025cites this paper
Improving bi-directional recurrent network for video super-resolution with deformable motion alignment structure
2025cites this paper
Dual Bidirectional Feature Enhancement Network for Continuous Space-Time Video Super-Resolution
2025cites this paper
MHAVSR: A multi-layer hybrid alignment network for video super-resolution
2025cites this paper
Spk2SRImgNet: Super-Resolve Dynamic Scene from Spike Stream via Motion Aligned Collaborative Filtering
2025cites this paper
Spatial Degradation-Aware and Temporal Consistent Diffusion Model for Compressed Video Super-Resolution
2025cites this paper
HAMSA: Hybrid attention transformer and multi-scale alignment aggregation network for video super-resolution
2025cites this paper
High-speed image enhancement: Real-time super-resolution and artifact removal for degraded analog footage
2025cites this paper
Low-Resource Video Super-Resolution using Memory, Wavelets, and Deformable Convolutions
2025cites this paper
REPVSR: Efficient Video Super-Resolution via Structural Re-Parameterization
2025influential citation
Video super resolution based on deformable 3D convolutional group fusion
2025influential citation
A Spatio-Temporal Recurrent Alignment for Video Super-Resolution
2025cites this paper
RepCaM++: Exploring Transparent Visual Prompt With Inference-Time Re-Parameterization for Neural Video Delivery
2025cites this paper
Video Quality Enhancement Using Multi-Domain Spatio-Temporal Deformable Fusion
2025cites this paper
Holistic review of super resolution algorithms
2025cites this paper
FDI-VSR: Video Super-Resolution Through Frequency-Domain Integration and Dynamic Offset Estimation
2025influential citation
Lightweight Video Super-Resolution Network Based on Pyramid Optical Flow Extraction and Alignment
2025cites this paper
GRNN:Recurrent Neural Network based on Ghost Features for Video Super-Resolution
2025cites this paper
JFFRA : Joint Flow and Feature Refinement Using Attention for Video Restoration
2025cites this paper
Infrared Tiny Structureless Object Detection Enhanced by Video Super-Resolution
2025cites this paper
FCA2: Frame Compression-Aware Autoencoder for Modular and Fast Compressed Video Super-Resolution
2025cites this paper
Controllable Reference Guided Diffusion with Local Global Fusion for Real World Remote Sensing Image Super Resolution
2025cites this paper
CVSR: complex-valued networks for video super-resolution
2025cites this paper
Event-based Video Super-Resolution via State Space Models
2025cites this paper
EOS: Energy-Optimized Super-Resolution on Mobile Devices for Live 360-Degree Videos
2025cites this paper
Fine-grained video super-resolution via spatial-temporal learning and image detail enhancement
2024cites this paper
FMA-Net: Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring
2024cites this paper
CTVSR: Collaborative Spatial–Temporal Transformer for Video Super-Resolution
2024influential citation
FDDCC-VSR: a lightweight video super-resolution network based on deformable 3D convolution and cheap convolution
2024influential citation
Omniscient Video Super-Resolution with Explicit-Implicit Alignment
2024influential citation
AverNet: All-in-one Video Restoration for Time-varying Unknown Degradations
2024cites this paper
Joint Delay-Sensitive and Power-Efficient Quality Control of Dynamic Video Streaming Using Adaptive Super-Resolution
2024cites this paper
Video Super-Resolution Method Using Multi-Frame Attention and Gradual Fusion
2024cites this paper
DVSRNet: Deep Video Super-Resolution Based on Progressive Deformable Alignment and Temporal-Sparse Enhancement
2024influential citation
Edge and Texture Enhanced Reference based Super-Resolution Network for Remote Sensing Images
2024influential citation
Low-light Video Enhancement with Conditional Diffusion Models and Wavelet Interscale Attentions
2024cites this paper
Target-specified reference-based deep learning network for joint image deblurring and resolution enhancement in surgical zoom lens camera calibration
2024cites this paper
A compressed video quality enhancement algorithm based on CNN and transformer hybrid network
2024cites this paper
Super-Resolution Reconstruction from Bayer-Pattern Spike Streams
2024cites this paper
Research on Video Frame Insertion Method of Complex
2024cites this paper
Multi-Stage Spatio-Temporal Fusion Network for Fast and Accurate Video Bit-Depth Enhancement
2024cites this paper
Satellite Video Super-Resolution via Unidirectional Recurrent Network and Various Degradation Modeling
2024cites this paper
Unsupervised Video Face Super-Resolution via Untrained Neural Network Priors
2024cites this paper
Learning Large-Factor EM Image Super-Resolution with Generative Priors
2024cites this paper
Spatio-Temporal Distortion Aware Omnidirectional Video Super-Resolution
2024cites this paper
A lightweight distillation recurrent convolution network on FPGA for real-time video super-resolution
2024cites this paper
SkipVSR: Adaptive Patch Routing for Video Super-Resolution with Inter-Frame Mask
2024cites this paper
Blind Face Video Restoration with Temporal Consistent Generative Prior and Degradation-Aware Prompt
2024cites this paper
Learning Truncated Causal History Model for Video Restoration
2024cites this paper
OnlyFlow: Optical Flow Based Motion Conditioning for Video Diffusion Models
2024cites this paper
ViChaser: Chase Your Viewpoint for Live Video Streaming With Block-Oriented Super-Resolution
2024cites this paper
Deep Blind Super-Resolution for Satellite Video
2024cites this paper
Video Frame Interpolation Using Real-Time Intermediate Flow Estimation
2024cites this paper
A 'deep' review of video super-resolution
2024cites this paper