EnVision-Research / DualCamCtrlLinks
Official Implementation of Paper [DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation]
☆74Updated last month
Alternatives and similar repositories for DualCamCtrl
Users that are interested in DualCamCtrl are comparing it to the libraries listed below
Sorting:
- G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning☆258Updated 3 weeks ago
- This is the repository that contains source code for the PhysGen3D.☆240Updated 4 months ago
- [ICLR 2026] NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamics☆121Updated 2 weeks ago
- Are Video Models Ready as Zero-shot Reasoners?☆84Updated 2 months ago
- [CVPR2024] Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion☆137Updated last year
- [NeurIPS 2025 Spotlight] Towards Understanding Camera Motions in Any Video☆269Updated 2 months ago
- ☆140Updated 10 months ago
- [NeurIPS 25] TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels☆180Updated last month
- 4DNeX: Feed-Forward 4D Generative Modeling Made Easy☆819Updated last month
- [ICRA 2026] A Unified Driving World Model for Future Generation and Perception☆136Updated last week
- [ICLR 2026] Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation☆378Updated 2 weeks ago
- [ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory☆414Updated 6 months ago
- [SIGGRAPH Conference 2024] GaussianPrediction: Dynamic 3D Gaussian Prediction for Motion Extrapolation and Free View Synthesis☆158Updated 10 months ago
- [AAAI 2026 🔥] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"☆176Updated 5 months ago
- Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation☆272Updated last month
- This project is the official implementation of "UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Gener…☆206Updated last week
- Official Implementation of "UniFlow: A Unified Pixel Flow Tokenizer for Visual Understanding and Generation"☆137Updated 3 months ago
- SAM 3D Objects with Multi-view Images☆201Updated 2 months ago
- DynamicTree: Interactive Real Tree Animation via Sparse Voxel Spectrum☆29Updated 2 months ago
- [NeurIPS'25] Official repository of Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations☆498Updated 2 months ago
- [ECCV2024] DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling☆228Updated 2 months ago
- [NeurIPS 2025 Spotlight] A Native Multimodal LLM for 3D Generation and Understanding☆539Updated 3 months ago
- Official implementation of "Next-Scale Autoregressive Models are Zero-Shot Single-Image Object View Synthesizers"☆45Updated 10 months ago
- [Official] AstraNav-Memory: Contexts Compression for Long Memory. An image-centric memory framework for lifelong embodied navigation via …☆29Updated 3 weeks ago
- OmniNWM: Omniscient Navigation World Models for Autonomous Driving☆303Updated this week
- [SIGGRAPH 2025] Officially implement of the paper "Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussi…☆156Updated 9 months ago
- ☆38Updated 3 months ago
- OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation☆255Updated 4 months ago
- Official implementation of "ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation"☆88Updated last month
- [ICML 2025 Oral] An official implementation of VideoRoPE & VideoRoPE++☆216Updated last week