Shark0-0 / VG4D
Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition(ICRA 2024)
☆11Updated 8 months ago
Alternatives and similar repositories for VG4D:
Users that are interested in VG4D are comparing it to the libraries listed below
- This is the project page of ShowRoom3D☆25Updated last year
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆20Updated 8 months ago
- Semantic Score Distillation Sampling for Compositional Text-to-3D Generation☆37Updated 3 months ago
- [ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.☆58Updated 3 months ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆90Updated 2 months ago
- Open-world 3D part segmentation of point clouds☆57Updated last month
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆76Updated 4 months ago
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆34Updated last month
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆31Updated last month
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆66Updated last month
- Official implementation of PARIS3D (Accepted to ECCV 2024).☆21Updated 3 months ago
- GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data☆55Updated last month
- Official code for "Amodal Completion via Progressive Mixed Context Diffusion" [CVPR 2024 Highlight]☆36Updated 5 months ago
- Official code repository for the paper: "TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision"☆40Updated last year
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆51Updated 7 months ago
- The official implementation of "CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities". (arXiv 2501.08983)☆36Updated this week
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆27Updated 4 months ago
- Unofficial Implementation of "Stable Video Diffusion Multi-View"☆78Updated 9 months ago
- ☆21Updated last month
- Sora Generates Videos with Stunning Geometrical Consistency☆47Updated 9 months ago
- [ECCV 2024] EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion.☆75Updated 7 months ago
- Code from the ECCV 2024 paper "Animal Avatar Reconstructing Animatable 3D Animals from Casual Videos".☆52Updated 2 months ago
- Learning Naturally Aggregated Appearance for Efficient 3D Editing☆34Updated last year
- Official repo of "Motion-Agent: A Conversational Framework for Human Motion Generation with LLMs"☆27Updated 3 months ago
- [ECCV 2024] Official Implementation of DragAPart: Learning a Part-Level Motion Prior for Articulated Objects.☆72Updated 5 months ago
- FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024☆19Updated last month
- ☆52Updated 3 months ago
- ☆38Updated 5 months ago
- The repository contains the official implementation of "DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery"☆24Updated 5 months ago
- Official code for 4Diffusion: Multi-view Video Diffusion Model for 4D Generation.☆87Updated 7 months ago