rolsheng / MM-VUFM4DSLinks
【IEEE T-IV】A systematic survey of multi-modal and multi-task visual understanding foundation models for driving scenarios
☆50Updated last year
Alternatives and similar repositories for MM-VUFM4DS
Users that are interested in MM-VUFM4DS are comparing it to the libraries listed below
Sorting:
- AeDet: Azimuth-invariant Multi-view 3D Object Detection, CVPR2023☆77Updated 2 years ago
- Curricular Object Manipulation in LiDAR-based Object Detection(CVPR 2023)☆40Updated 2 years ago
- ☆73Updated 3 months ago
- MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation☆94Updated 2 years ago
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆33Updated 8 months ago
- [NeurIPS 2025] SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models☆74Updated 2 months ago
- [ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding☆49Updated 2 years ago
- [ECCV 2024] SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras☆32Updated last year
- Official repository for the NuScenes-MQA. This paper is accepted by LLVA-AD Workshop at WACV 2024.☆35Updated last year
- [AAAI24]This is the implementation for the paper M-BEV: Masked BEV Perception for Robust Autonomous Driving☆40Updated last year
- [ACM MM2022, TIP2024] Graph-DETR Series for Multi-View 3D Object Detection☆42Updated 2 years ago
- Official PyTorch implementation of End-to-end 3D Tracking with Decoupled Queries [ICCV 2023]☆70Updated last year
- ☆17Updated last year
- The official implementation of the ECCV 2024 paper: Continuity Preserving Online CenterLine Graph Learning☆33Updated 11 months ago
- ☆70Updated last year
- [AAAI2025] Language Prompt for Autonomous Driving☆151Updated 2 months ago
- Is Your HD Map Constructor Reliable under Sensor Corruptions?☆37Updated last year
- [ICLR 2024] MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection☆75Updated last year
- [CVPR 2023] BEVGuide: BEV-Guided Multi-Modality Fusion for Driving Perception☆35Updated 2 years ago
- ☆60Updated last year
- [ECCV 2024] Towards Stable 3D Object Detection☆47Updated last year
- Benchmark and model for step-by-step reasoning in autonomous driving.☆67Updated 8 months ago
- [ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving☆64Updated last year
- [IROS 2024]InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy Prediction☆30Updated last year
- the official code of DriveMonkey☆40Updated 6 months ago
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆49Updated last year
- [NeurIPS 2025] OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection☆60Updated last year
- [IROS 2023] DualCross: Cross-Modality Cross-Domain Adaptation for Monocular BEV Perception☆32Updated 2 years ago
- This is the implementation of the paper "SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection" (I…☆81Updated 2 years ago
- Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction (ICRA 2025)☆48Updated this week