VGPNN: Diverse Generation from a Single Video Made Possible

GANs are able to perform generation and manipulation tasks, trained on a single video. However, these single video GANs require unreasonable amount of time to train on a single video, rendering them almost impractical. In this paper we question the necessity of a GAN for generation from a single video, and introduce a non-parametric baseline for a variety of generation and manipulation tasks. Inspired by Granot et al. (2021), we revive classical space-time patches-nearest-neighbors approaches and adapt them to a scalable unconditional generative model, without any learning. This simple baseline surprisingly outperforms single-video GANs in visual quality and realism (confirmed by quantitative and qualitative evaluations), and is disproportionately faster (runtime reduced from several days to seconds). Other than diverse video generation, we demonstrate other applications using the same framework, including video analogies and spatio-temporal retargeting. Our proposed approach is easily scaled to Full-HD videos. These observations show that the classical approaches, if adapted correctly, significantly outperform heavy deep learning machinery for these tasks. This sets a new baseline for single-video generation and manipulation tasks, and no less important -- makes diverse generation from a single video practically possible for the first time.

Input Video (1280x1920)	HP-VAE-GAN (144x256) 8 days training

Ours (1280x1920) 9 mins per video	Ours (144x256) 18 secs per video

Input Video (1280x1920)	HP-VAE-GAN (144x256) 8 days training

Ours (1280x1920) 9 mins per video	Ours (144x256) 18 secs per video

Input Video (720x1280)	HP-VAE-GAN (144x256) 8 days training

Ours (720x1280) 6 mins per video	Ours (144x256) 18 secs per video

Input Video (1280x1920)	HP-VAE-GAN (144x256) 8 days training

Ours (1280x1920) 9 mins per video	Ours (144x256) 18 secs per video

VGPNN: Diverse Generation from a Single Video Made Possible

Diverse Generation from a Single Video

Abstract

Video Analogies

Sketch to Video

Video Analogies

Qualitative Comparison

Spatial Retargeting

Temporal Retargeting - Video Shortened (“Summarization”)

Temporal Retargeting - Video Extended

Conditional Inpainting

Limitations

BibTeX