Stable Virtual Camera: Multi-View Video Generation with 3D Camera Control

Mar 18

We present Stable Virtual Camera, a generalist diffusion model that creates novel views of a scene, given any number of input views and target cameras. Existing works struggle to generate either large viewpoint changes or temporally smooth samples, while relying on specific task configurations. Our approach overcomes these limitations through simple model design, optimized training recipe, and flexible sampling strategy that generalize across view synthesis tasks at test time. As a result, our samples maintain high consistency without requiring additional 3D representation-based distillation, thus streamlining view synthesis in the wild. Furthermore, we show that our method can generate high-quality videos lasting up to half a minute with seamless loop closure. Extensive benchmarking demonstrates that Stable Virtual Camera outperforms existing methods across different datasets and settings.

Read the paper

Savannah Martin

Stable Virtual Camera: Multi-View Video Generation with 3D Camera Control

Company

Models

Deployment

ResourceS

Contact Us

Legal

Applications

Join the Mailing List

Stable Virtual Camera: Multi-View Video Generation with 3D Camera Control

Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation

SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images

Company

Models

Deployment

ResourceS

Contact Us

Legal

Applications

Join the Mailing List