jxbbb / TOD3Cap
[ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
☆118Updated 2 months ago
Alternatives and similar repositories for TOD3Cap:
Users that are interested in TOD3Cap are comparing it to the libraries listed below
- ☆90Updated 3 months ago
- Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆79Updated 4 months ago
- [NeurIPS 2024] A Unified Framework for 3D Scene Understanding☆137Updated 5 months ago
- [ECCV 2024] Monocular Occupancy Prediction for Scalable Indoor Scenes☆58Updated 7 months ago
- OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving☆173Updated 11 months ago
- Code for CVPR2025 paper: Generating Multimodal Driving Scenes via Next-Scene Prediction☆60Updated last month
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆89Updated 3 months ago
- [ICRA'2024] MonoOcc: Digging into Monocular Semantic Occupancy Prediction☆100Updated last year
- CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving☆56Updated 6 months ago
- ☆46Updated 3 months ago
- Project Page for GaussianFormer☆25Updated 11 months ago
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆103Updated 3 months ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆67Updated 5 months ago
- [AAAI 2024] Mono3DVG: 3D Visual Grounding in Monocular Images, AAAI, 2024☆58Updated last year
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆136Updated last month
- Official Code Release of Delphi☆55Updated 11 months ago
- [CVPR 2025] ReconDreamer☆137Updated 4 months ago
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding☆115Updated 2 weeks ago
- Source code for NeurIPS paper "POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images"☆106Updated 4 months ago
- ☆35Updated this week
- Official Github Repo for GEM☆51Updated last week
- Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving (AAAI-25)☆40Updated 2 months ago
- ☆78Updated last year
- [ECCV 2024] Occupancy as Set of Points☆89Updated 10 months ago
- ☆108Updated 10 months ago
- ☆84Updated 4 months ago
- ☆23Updated 3 months ago
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆44Updated 3 months ago
- [ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving☆30Updated 2 months ago
- [CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"☆225Updated 8 months ago