pixeli99 / W-CODA2024-Track2Links
This repository is dedicated to Track 2 of the W-CODA 2024 Workshop, "Multimodal Perception and Comprehension of Corner Cases in Autonomous Driving," held at ECCV 2024.
☆15Updated last year
Alternatives and similar repositories for W-CODA2024-Track2
Users that are interested in W-CODA2024-Track2 are comparing it to the libraries listed below
Sorting:
- ☆29Updated last year
- Official Code Release of Delphi☆56Updated last year
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆92Updated 11 months ago
- ☆102Updated last year
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆79Updated 11 months ago
- Adding Scene-Centric Forecasting Control to Occupancy World Model☆33Updated 3 months ago
- [IROS 2023] DualCross: Cross-Modality Cross-Domain Adaptation for Monocular BEV Perception☆32Updated 2 years ago
- [CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training☆46Updated 2 years ago
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆129Updated 8 months ago
- ☆69Updated last year
- [ICLR 2025] Official code implementation for the paper "X-Drive: Cross-modality Consistent Multi-Sensor Data Synthesis for Driving Scenar…☆61Updated 8 months ago
- Visual Spatial Tuning☆146Updated 2 weeks ago
- Code for CVPR2025 paper: Generating Multimodal Driving Scenes via Next-Scene Prediction☆93Updated 3 weeks ago
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆112Updated 9 months ago
- [WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆60Updated 8 months ago
- [ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding☆69Updated 5 months ago
- [Arxiv'25] MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization☆51Updated 2 months ago
- [ECCV 2024] Official implementation of "RangeLDM: Fast Realistic LiDAR Point Cloud Generation"☆41Updated last year
- ☆18Updated 7 months ago
- ☆15Updated last year
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024☆30Updated last year
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆108Updated 10 months ago
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆49Updated last year
- ☆26Updated 4 months ago
- [ICCV 2025] InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models☆80Updated 5 months ago
- Project Page for GaussianFormer☆24Updated last year
- Street-View Image Generation from a Bird’s-Eye View Layout: Official Codebase☆77Updated last year
- [ECCV'24] Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene.☆38Updated last year
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆53Updated 10 months ago
- [CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"☆251Updated last year