VLG | Computer Vision and Learning Group

Authors:Yiming Wang, Qihang Zhang, Shengqu Cai, Tong Wu, Jan Ackermann, Zhengfei Kuang, Yang Zheng, Frano Rajič, Siyu Tang, Gordon Wetzstein

Abstract

Emerging video diffusion models achieve high visual fidelity but fundamentally couple scene dynamics with camera motion, limiting their ability to provide precise spatial and temporal control. BulletTime is a 4D-controllable video diffusion framework that explicitly decouples scene dynamics from camera pose, enabling fine-grained manipulation of both scene dynamics and camera viewpoint.

Authors:

Yiming Wang
Direct doctorate student CNB G 100.5

Prof. Dr. Siyu Tang
Assistant Professor of Computer Science, CNB G 104

Links:

Project PDF Source BibTeX

BulletTime: Decoupled Control of Time and Camera Pose for Video Generation

Conference: Conference on Computer Vision and Pattern Recognition (CVPR 2026)

Abstract

Authors:

Links: