Bosszhe / EMIFF
EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection
☆65Updated 4 months ago
Related projects: ⓘ
- MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning☆60Updated 6 months ago
- [CVPR 2024] Official PyTorch Code of SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects☆78Updated 2 months ago
- [ECAI 2023] MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation Coefficient☆30Updated 9 months ago
- WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆75Updated 9 months ago
- [CVPR 2024] Symphonies (Scene-from-Insts): Symphonize 3D Semantic Scene Completion with Contextual Instance Queries☆156Updated 2 months ago
- Official Toolkit for The RoboDrive Challenge☆71Updated 3 months ago
- A systematic survey of multi-modal and multi-task visual understanding foundation models for driving scenarios☆45Updated 3 months ago
- [CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding☆83Updated 10 months ago
- BEVGen☆59Updated 7 months ago
- [ECCV 2024] The official code for "Dolphins: Multimodal Language Model for Driving“☆29Updated 2 months ago
- OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving☆137Updated 3 months ago
- ☆95Updated 6 months ago
- LightwheelOcc: A 3D Occupancy Synthetic Dataset in Autonomous Driving☆69Updated 4 months ago
- ☆46Updated last month
- Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving☆52Updated 3 months ago
- [ECCV 2024] Embodied Understanding of Driving Scenarios☆137Updated 2 weeks ago
- ☆114Updated 2 months ago
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆93Updated last month
- ☆195Updated last month
- [NeurIPS 2023] Asynchrony-Robust Collaborative Perception via Bird’s Eye View Flow☆66Updated 11 months ago
- Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)☆57Updated 2 months ago
- InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy Prediction☆22Updated 2 months ago
- [ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving☆38Updated 2 weeks ago
- Codebase for the WayveScenes101 Dataset☆154Updated last month
- [ICRA'2024] MonoOcc: Digging into Monocular Semantic Occupancy Prediction☆74Updated 10 months ago
- Talk2BEV: Language-Enhanced Bird's Eye View Maps (Accepted to ICRA'24)☆92Updated 7 months ago
- Enhancing End-to-End Autonomous Driving with Latent World Model☆73Updated 3 months ago
- Simulator-conditioned Driving Scene Generation☆45Updated 2 months ago
- ☆12Updated 3 months ago
- [CVPR2024] Official implementation of "RCooper: A Real-world Large-scale Dataset for Roadside Cooperative Perception"☆68Updated 3 months ago