hou-yz / dvgformer
Code for our paper: Learning Camera Movement Control from Real-World Drone Videos
☆27Updated last month
Alternatives and similar repositories for dvgformer:
Users that are interested in dvgformer are comparing it to the libraries listed below
- ☆32Updated last week
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆29Updated this week
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆66Updated last month
- [ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.☆60Updated 6 months ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆98Updated 4 months ago
- This is the project page of ShowRoom3D☆25Updated last year
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆22Updated 10 months ago
- Repo for "Human-Centric Foundation Models: Perception, Generation and Agentic Modeling" (https://arxiv.org/abs/2502.08556)☆36Updated last month
- ☆15Updated last year
- Unofficial Implementation of "Stable Video Diffusion Multi-View"☆78Updated 11 months ago
- [3DV 2025] Learning Naturally Aggregated Appearance for Efficient 3D Editing☆34Updated last month
- Sora Generates Videos with Stunning Geometrical Consistency☆49Updated last year
- [ICLR' 25] AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation☆60Updated 2 weeks ago
- From Geometry to Texture: A Hierarchical Framework for Efficient Text-to-3D Generation☆31Updated last year
- [arXiv'24] Holistic-Motion2D: Scalable Whole-body Human Motion Generation in 2D Space☆39Updated 4 months ago
- Web page for "🍅HumanTOMATO: Text-aligned Whole-body Motion Generation".☆14Updated 10 months ago
- [CVPR 2024] Official code for EgoGen: An Egocentric Synthetic Data Generator☆78Updated 3 weeks ago
- ☆37Updated last week
- Official Code for "Digital Life Project: Autonomous 3D Characters with Social Intelligence"☆33Updated 6 months ago
- ☆38Updated 5 months ago
- ☆55Updated last month
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆77Updated 7 months ago
- TORE: Token Reduction for Efficient Human Mesh Recovery with Transformer☆47Updated last year
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆28Updated last month
- Official Code of SocioMind for CVPR 2024 paper "Digital Life Project: Autonomous 3D Characters with Social Intelligence"☆24Updated 6 months ago
- [ECCV 2024] Official Implementation of DragAPart: Learning a Part-Level Motion Prior for Articulated Objects.☆77Updated 8 months ago
- HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video☆68Updated last year
- [ECCV 2024] HiFi-123: Towards High-fidelity One Image to 3D Content Generation☆67Updated 8 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆88Updated 2 weeks ago
- ☆21Updated this week