Shark0-0 / VG4D
Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition(ICRA 2024)
☆11Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for VG4D
- This is the project page of ShowRoom3D☆25Updated 11 months ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆59Updated last week
- [ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.☆57Updated last month
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆64Updated last week
- ☆22Updated 6 months ago
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆73Updated 2 months ago
- ☆32Updated 7 months ago
- Sora Generates Videos with Stunning Geometrical Consistency☆47Updated 7 months ago
- Semantic Score Distillation Sampling for Compositional Text-to-3D Generation☆28Updated last month
- Unofficial Implementation of "Stable Video Diffusion Multi-View"☆73Updated 7 months ago
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆17Updated 6 months ago
- ☆21Updated 3 months ago
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆26Updated 3 weeks ago
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆87Updated 6 months ago
- Generative World Explorer☆32Updated this week
- Official implementation of PARIS3D (Accepted to ECCV 2024).☆18Updated last month
- [ECCV 2024] EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion.☆62Updated 5 months ago
- Official PyTorch implementation of "A Unified Approach for Text- and Image-guided 4D Scene Generation", [CVPR 2024]☆62Updated 6 months ago
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆25Updated 2 months ago
- Code for paper Background Prompting for Improved Object Depth☆29Updated last year
- Learning Naturally Aggregated Appearance for Efficient 3D Editing☆34Updated 10 months ago
- [ICCV 2023] Rendering Humans from Object-Occluded Monocular Videos☆42Updated 9 months ago
- Official code for "Amodal Completion via Progressive Mixed Context Diffusion" [CVPR 2024 Highlight]☆30Updated 3 months ago
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆45Updated 5 months ago
- ☆36Updated last month
- Official code repository for the paper: "TAPS3D: Text-Guided 3D Textured Shape Generation from Pseudo Supervision"☆40Updated last year
- ☆15Updated 7 months ago
- [ECCV 2024] HiFi-123: Towards High-fidelity One Image to 3D Content Generation☆59Updated 4 months ago
- ☆28Updated 5 months ago
- [ECCV 2024] Official code for: SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer☆80Updated 2 months ago