Hongbin98 / DriveGENLinks
This is the official project repository for "DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation" (CVPR 2025)
☆37Updated 5 months ago
Alternatives and similar repositories for DriveGEN
Users that are interested in DriveGEN are comparing it to the libraries listed below
Sorting:
- Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving (AAAI-25)☆95Updated 11 months ago
- [ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving☆52Updated 11 months ago
- Street-View Image Generation from a Bird’s-Eye View Layout: Official Codebase☆80Updated last year
- Code for CVPR2025 paper: Generating Multimodal Driving Scenes via Next-Scene Prediction☆100Updated 2 months ago
- [ICLR 2025] Official code implementation for the paper "X-Drive: Cross-modality Consistent Multi-Sensor Data Synthesis for Driving Scenar…☆63Updated 10 months ago
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆61Updated last year
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆116Updated last year
- [NeurIPS 2025] RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning☆178Updated 2 months ago
- [NeurIPS 2025] OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection☆67Updated last year
- Official Code Release of Delphi☆56Updated last year
- [CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"☆253Updated last year
- [ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving☆66Updated last year
- OPUS: Occupancy Prediction Using a Sparse Set☆142Updated 3 weeks ago
- ☆127Updated last year
- [ICRA2025] A dual-branch conditional diffusion model designed to enhance driving scene generation across multiple views and video sequenc…☆37Updated 8 months ago
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆128Updated 10 months ago
- DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning☆76Updated last month
- [NeurIPS 2025] SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models☆78Updated 4 months ago
- ECCV 2024 Paper List about Autonomous Driving☆129Updated last year
- ☆137Updated last month
- [CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding☆164Updated 2 years ago
- OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving☆196Updated last year
- Code release for the ECCV 2024 paper 'Fully Test-Time Adaptation for Monocular 3D Object Detection'☆57Updated last year
- ☆54Updated last year
- ☆70Updated last year
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆83Updated last year
- [ICCV 2025] Language Driven Occupancy Prediction☆35Updated last year
- UniPAD: A Universal Pre-training Paradigm for Autonomous Driving (CVPR 2024)☆203Updated last year
- ☆46Updated 8 months ago
- (ICCV2025) End-to-End Driving with Online Trajectory Evaluation via BEV World Model☆192Updated 7 months ago