GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography
☆108Dec 31, 2025Updated 2 months ago
Alternatives and similar repositories for GenDoP
Users that are interested in GenDoP are comparing it to the libraries listed below
Sorting:
- RelightVid: Temporal-Consistent Diffusion Model for Video Relighting☆102Apr 2, 2025Updated 11 months ago
- (ICLR 2026)Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’☆58Jan 26, 2026Updated last month
- [ICLR 2026] Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache☆55Jan 26, 2026Updated last month
- Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation☆48Dec 11, 2024Updated last year
- [NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆85Sep 18, 2025Updated 5 months ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆35Aug 28, 2025Updated 6 months ago
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"☆10Dec 9, 2023Updated 2 years ago
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"☆37Nov 26, 2025Updated 3 months ago
- Official Implementation of VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Jo…☆23Jun 27, 2025Updated 8 months ago
- Official code of "Imagine360: Immersive 360 Video Generation from Perspective Anchor"☆166May 14, 2025Updated 9 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆37Jan 21, 2025Updated last year
- [CVPR 2026] Official pytorch implementation of "ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding"☆21Dec 17, 2025Updated 2 months ago
- Official repo for "IDArb: Intrinsic Decomposition for arbitrary number of input views and illuminations"☆95Jul 9, 2025Updated 7 months ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆85May 4, 2025Updated 9 months ago
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆73Dec 27, 2024Updated last year
- [CVPR 2024] S-DyRF: Reference-Based Stylized Radiance Fields for Dynamic Scenes☆13Jun 1, 2024Updated last year
- [ICCV 2025] MM-IFEngine: Towards Multimodal Instruction Following☆117Feb 13, 2026Updated 2 weeks ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆107Mar 19, 2025Updated 11 months ago
- [ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"☆173Feb 4, 2026Updated 3 weeks ago
- Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"☆224Aug 7, 2025Updated 6 months ago
- [CVPR 2026] Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"☆112Updated this week
- open-sourced video dataset with dynamic scenes and camera movements annotation☆86Apr 24, 2025Updated 10 months ago
- ☆642May 24, 2024Updated last year
- [ICLR 2026] An official implementation of "STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence"☆39Jan 17, 2026Updated last month
- Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).☆371Mar 14, 2025Updated 11 months ago
- ☆77Oct 25, 2024Updated last year
- [ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation☆367Jul 4, 2025Updated 7 months ago
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆282Nov 18, 2025Updated 3 months ago
- [ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"☆503Aug 4, 2025Updated 6 months ago
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆46Nov 25, 2025Updated 3 months ago
- ☆90May 30, 2025Updated 9 months ago
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆218Dec 9, 2025Updated 2 months ago
- ☆58Oct 19, 2025Updated 4 months ago
- [SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control☆807Jun 9, 2025Updated 8 months ago
- [CVPR 2025] Official implementation of ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way☆47Oct 10, 2025Updated 4 months ago
- 📷 [CVPR'26] Camera-controlled text-to-video generation, now with intrinsics, distortion and orientation control!☆122Feb 21, 2026Updated last week
- [TPAMI 2025] ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis☆1,515Dec 13, 2025Updated 2 months ago
- [ICML 2025 Oral] An official implementation of VideoRoPE & VideoRoPE++☆219Feb 2, 2026Updated last month
- AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers☆155Sep 16, 2025Updated 5 months ago