jxbbb / TOD3CapLinks
[ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
☆128Updated 10 months ago
Alternatives and similar repositories for TOD3Cap
Users that are interested in TOD3Cap are comparing it to the libraries listed below
Sorting:
- ☆132Updated last month
- OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving☆194Updated last year
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆57Updated 11 months ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆82Updated last year
- [ICCV 2025] Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding☆67Updated 11 months ago
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆112Updated 11 months ago
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆96Updated last year
- CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving☆71Updated last year
- ☆125Updated last year
- [ICLR 2025] Official code implementation for the paper "X-Drive: Cross-modality Consistent Multi-Sensor Data Synthesis for Driving Scenar…☆62Updated 9 months ago
- Source code for NeurIPS paper "POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images"☆114Updated 11 months ago
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆194Updated 3 months ago
- Code for "DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT"☆229Updated 11 months ago
- Official Code Release of Delphi☆56Updated last year
- Code for CVPR2025 paper: Generating Multimodal Driving Scenes via Next-Scene Prediction☆99Updated last month
- [ECCV 2024] Monocular Occupancy Prediction for Scalable Indoor Scenes☆66Updated last year
- Official Github Repo for GEM☆99Updated 2 months ago
- [ICCV 2025] Detect Anything 3D in the Wild☆246Updated 3 weeks ago
- [NeurIPS 2025]Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency☆73Updated 3 months ago
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆112Updated 10 months ago
- UniPAD: A Universal Pre-training Paradigm for Autonomous Driving (CVPR 2024)☆203Updated last year
- [CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"☆252Updated last year
- ☆51Updated 7 months ago
- DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning☆61Updated 2 weeks ago
- [AAAI 2024] Mono3DVG: 3D Visual Grounding in Monocular Images, AAAI, 2024☆65Updated last year
- [AAAI 2025] DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation☆224Updated 9 months ago
- Code release for our NeurIPS 2023 paper "Uni3DETR: Unified 3D Detection Transformer", our ECCV 2024 paper "OV-Uni3DETR: Towards Unified O…☆115Updated last year
- [ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving☆50Updated 10 months ago
- [3DV 2026] Open Vocabulary Monocular 3D Object Detection☆70Updated last month
- [CVPR 2025] ReconDreamer☆197Updated last year