GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography
☆109Dec 31, 2025Updated 2 months ago
Alternatives and similar repositories for GenDoP
Users that are interested in GenDoP are comparing it to the libraries listed below
Sorting:
- RelightVid: Temporal-Consistent Diffusion Model for Video Relighting☆109Apr 2, 2025Updated 11 months ago
- (ICLR 2026)Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’☆59Jan 26, 2026Updated last month
- Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation☆48Dec 11, 2024Updated last year
- [ICLR 2026] Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache☆58Jan 26, 2026Updated last month
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆35Aug 28, 2025Updated 6 months ago
- [NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆86Sep 18, 2025Updated 6 months ago
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"☆10Dec 9, 2023Updated 2 years ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆37Jan 21, 2025Updated last year
- Official code of "Imagine360: Immersive 360 Video Generation from Perspective Anchor"☆170May 14, 2025Updated 10 months ago
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"☆39Nov 26, 2025Updated 3 months ago
- Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"☆231Aug 7, 2025Updated 7 months ago
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆73Dec 27, 2024Updated last year
- ☆22Aug 5, 2024Updated last year
- This project explores the opportunities of deep learning for camera control in virtual cinematography.☆106Jan 22, 2024Updated 2 years ago
- [CVPR 2024] S-DyRF: Reference-Based Stylized Radiance Fields for Dynamic Scenes☆13Jun 1, 2024Updated last year
- Official repo for "IDArb: Intrinsic Decomposition for arbitrary number of input views and illuminations"☆97Jul 9, 2025Updated 8 months ago
- [ICCV 2025] MM-IFEngine: Towards Multimodal Instruction Following☆119Feb 13, 2026Updated last month
- Official Implementation of VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Jo…☆23Jun 27, 2025Updated 8 months ago
- ☆77Oct 25, 2024Updated last year
- [ICLR 2026] An official implementation of "STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence"☆40Jan 17, 2026Updated 2 months ago
- [CVPR 2026] Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"☆121Feb 25, 2026Updated 3 weeks ago
- [ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"☆186Feb 4, 2026Updated last month
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆109Mar 19, 2025Updated last year
- ☆57Oct 19, 2025Updated 5 months ago
- ☆644May 24, 2024Updated last year
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆86May 4, 2025Updated 10 months ago
- Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).☆374Mar 14, 2025Updated last year
- open-sourced video dataset with dynamic scenes and camera movements annotation☆87Apr 24, 2025Updated 10 months ago
- [CVPR 2026] Official pytorch implementation of "ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding"☆22Dec 17, 2025Updated 3 months ago
- [CVPR 2025] Official implementation of ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way☆48Oct 10, 2025Updated 5 months ago
- ☆10Jun 5, 2023Updated 2 years ago
- [ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation☆367Jul 4, 2025Updated 8 months ago
- [ICCV-2025] Official implementation of Bootstrap3D: Improving Multi-view Diffusion Model with Synthetic Data☆96Jul 26, 2025Updated 7 months ago
- ☆91May 30, 2025Updated 9 months ago
- [SIGGRAPH 2025] LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation"☆314Jul 24, 2025Updated 7 months ago
- ☆12Sep 27, 2023Updated 2 years ago
- [ICML 2025 Oral] An official implementation of VideoRoPE & VideoRoPE++☆219Feb 2, 2026Updated last month
- official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"☆166Sep 29, 2025Updated 5 months ago
- List of papers on 4D Generation.☆324Oct 10, 2024Updated last year