Shark0-0 / VG4D
Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition(ICRA 2024)
☆15Updated last year
Alternatives and similar repositories for VG4D
Users that are interested in VG4D are comparing it to the libraries listed below
Sorting:
- This is the project page of ShowRoom3D☆25Updated last year
- ☆21Updated 5 months ago
- ☆27Updated last year
- [CVPR 2025 Highlight] Towards Autonomous Micromobility through Scalable Urban Simulation☆21Updated 2 weeks ago
- ☆23Updated last month
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆20Updated last month
- [ArXiv 2025] DNF-Avatar: Distilling Neural Fields for Real-time Animatable Avatar Relighting☆25Updated 3 weeks ago
- A list of works on video generation towards world model☆58Updated last week
- [ICLR 2025] Dataset and Code for Paper "Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels"☆36Updated last month
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆86Updated 2 months ago
- ☆40Updated 9 months ago
- [CVPR 2024] DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior☆70Updated last year
- ☆11Updated 2 months ago
- ☆25Updated last year
- [ICCV2023] "Enhancing NeRF akin to Enhancing LLMs: Generalizable NeRF Transformer with Mixture-of-View-Experts" by Wenyan Cong, Hanxue Li…☆48Updated last year
- Official Code for 'AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction'☆30Updated last week
- Generative model for 3D objects.☆16Updated last year
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆78Updated 8 months ago
- Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆39Updated 2 months ago
- [AAAI 2025] More Text, Less Point: Towards 3D Data-Efficient Point-Language Understanding☆16Updated 5 months ago
- [3DV 2025] Learning Naturally Aggregated Appearance for Efficient 3D Editing☆34Updated 3 months ago
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"☆10Updated last year
- ☆55Updated 7 months ago
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆22Updated last year
- open-sourced video dataset with dynamic scenes and camera movements annotation☆54Updated 3 weeks ago
- A framework named B^2-DiffuRL for RL-based diffusion model fine-tuning.☆29Updated last month
- Official repo for StyleMe3D☆19Updated 3 weeks ago
- (ECCV 2024) Official implementation of Paper ''DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation''☆39Updated 6 months ago
- Official code for 4Diffusion: Multi-view Video Diffusion Model for 4D Generation.☆102Updated 11 months ago
- [CVPR 2025] GPS as a Control Signal for Image Generation☆18Updated 2 months ago