Shark0-0 / VG4DLinks
Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition(ICRA 2024)
☆15Updated last year
Alternatives and similar repositories for VG4D
Users that are interested in VG4D are comparing it to the libraries listed below
Sorting:
- This is the project page of ShowRoom3D☆25Updated last year
- Official implementation of PARIS3D (Accepted to ECCV 2024).☆25Updated 9 months ago
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆22Updated last year
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆69Updated 6 months ago
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆26Updated last month
- ☆21Updated 6 months ago
- [CVPR 2025] Open-World Amodal Appearance Completion☆20Updated 3 months ago
- [arXiv'24] Holistic-Motion2D: Scalable Whole-body Human Motion Generation in 2D Space☆43Updated 7 months ago
- [ICLR 2025] Dataset and Code for Paper "Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels"☆40Updated 2 months ago
- [3DV 2025] Learning Naturally Aggregated Appearance for Efficient 3D Editing☆34Updated 4 months ago
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆21Updated 2 months ago
- ☆43Updated 2 months ago
- ☆23Updated last month
- Self-reimplemented version of 4D-LRM.☆30Updated 3 weeks ago
- Official code repository for the paper: "TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision"☆41Updated 2 years ago
- ☆23Updated 2 months ago
- ObjCtrl-2.5D☆46Updated 2 months ago
- [ICCV 2023] Code for "Multi-task View Synthesis with Neural Radiance Fields"☆11Updated last year
- VideoDirector [CVPR 2025]☆22Updated 2 months ago
- [ICLR' 25] AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation☆64Updated 3 months ago
- [AAAI 2025] More Text, Less Point: Towards 3D Data-Efficient Point-Language Understanding☆20Updated last month
- [CVPR 25] A framework named B^2-DiffuRL for RL-based diffusion model fine-tuning.☆30Updated 2 months ago
- DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors☆39Updated 9 months ago
- ☆33Updated last month
- Repo for "Human-Centric Foundation Models: Perception, Generation and Agentic Modeling" (https://arxiv.org/abs/2502.08556)☆49Updated 4 months ago
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆30Updated 7 months ago
- ☆25Updated last year
- [IJCV 2024] MoDA: Modeling Deformable 3D Objects from Casual Videos☆33Updated 5 months ago
- Sora Generates Videos with Stunning Geometrical Consistency☆50Updated last year
- (CVPR2025 Highlight) Official repository of paper "Panorama Generation From NFoV Image Done Right"☆13Updated 3 weeks ago