Shark0-0 / VG4D
Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition(ICRA 2024)
☆15Updated last year
Alternatives and similar repositories for VG4D:
Users that are interested in VG4D are comparing it to the libraries listed below
- This is the project page of ShowRoom3D☆25Updated last year
- [ICLR 2025] Dataset and Code for Paper "Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels"☆23Updated last week
- [ICLR' 25] AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation☆60Updated last month
- ☆22Updated 3 weeks ago
- Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆38Updated last month
- Official implementation of PARIS3D (Accepted to ECCV 2024).☆23Updated 7 months ago
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆22Updated 11 months ago
- [IJCV 2024] MoDA: Modeling Deformable 3D Objects from Casual Videos☆32Updated 3 months ago
- Toolbox for GTA-Human Datasets☆16Updated 6 months ago
- GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data☆64Updated 4 months ago
- DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors☆37Updated 7 months ago
- [CVPR 2024 Hightlight] Code release for "The More You See in 2D, the More You Perceive in 3D"☆61Updated 6 months ago
- Repo for "Human-Centric Foundation Models: Perception, Generation and Agentic Modeling" (https://arxiv.org/abs/2502.08556)☆39Updated 2 months ago
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆78Updated 3 weeks ago
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆78Updated 8 months ago
- Official code repository for the paper: "TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision"☆40Updated last year
- ☆21Updated 4 months ago
- ☆40Updated 8 months ago
- [ECCV 2024] Official Implementation of DragAPart: Learning a Part-Level Motion Prior for Articulated Objects.☆80Updated 9 months ago
- ObjCtrl-2.5D☆43Updated 3 weeks ago
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆69Updated 4 months ago
- Official code for 4Diffusion: Multi-view Video Diffusion Model for 4D Generation.☆100Updated 10 months ago
- [ICCV2023] "Enhancing NeRF akin to Enhancing LLMs: Generalizable NeRF Transformer with Mixture-of-View-Experts" by Wenyan Cong, Hanxue Li…☆48Updated last year
- ☆25Updated last year
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆84Updated last month
- ☆55Updated 7 months ago
- ☆16Updated last year
- [CVPR 2025] WildAvatar: Learning In-the-wild 3D Avatars from the Web☆103Updated last month
- [ArXiv 2025] DNF-Avatar: Distilling Neural Fields for Real-time Animatable Avatar Relighting☆24Updated this week
- [CVPR 2025] GPS as a Control Signal for Image Generation☆16Updated last month