rolsheng / MM-VUFM4DS
【IEEE T-IV】A systematic survey of multi-modal and multi-task visual understanding foundation models for driving scenarios
☆50Updated 11 months ago
Alternatives and similar repositories for MM-VUFM4DS:
Users that are interested in MM-VUFM4DS are comparing it to the libraries listed below
- ☆37Updated 2 months ago
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆30Updated last month
- AeDet: Azimuth-invariant Multi-view 3D Object Detection, CVPR2023☆74Updated last year
- Benchmark and model for step-by-step reasoning in autonomous driving.☆48Updated last month
- MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation☆92Updated 2 years ago
- Official repository for the NuScenes-MQA. This paper is accepted by LLVA-AD Workshop at WACV 2024.☆25Updated last year
- ☆66Updated 4 months ago
- [AAAI2025] Language Prompt for Autonomous Driving☆135Updated 4 months ago
- Curricular Object Manipulation in LiDAR-based Object Detection(CVPR 2023)☆37Updated last year
- [AAAI24]This is the implementation for the paper M-BEV: Masked BEV Perception for Robust Autonomous Driving☆38Updated last year
- ☆73Updated last month
- ☆59Updated 8 months ago
- [ACM MM2022, TIP2024] Graph-DETR Series for Multi-View 3D Object Detection☆41Updated last year
- OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection☆48Updated 5 months ago
- [ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding☆48Updated last year
- The official implementation of the ECCV 2024 paper: Continuity Preserving Online CenterLine Graph Learning☆28Updated 4 months ago
- DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)☆101Updated last year
- Official repository for paper "Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving"☆29Updated 2 months ago
- EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network☆47Updated last month
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆42Updated 7 months ago
- ☆53Updated 8 months ago
- ☆16Updated 9 months ago
- Codes for ICLR 2024: "MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection"☆70Updated 9 months ago
- [IROS 2023] DualCross: Cross-Modality Cross-Domain Adaptation for Monocular BEV Perception☆29Updated last year
- [IROS 2024]InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy Prediction☆27Updated 10 months ago
- ☆21Updated last year
- [ICLR'25] Official Implementation of STAMP: Scalable Task And Model-agnostic Collaborative Perception☆26Updated 3 months ago
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆89Updated 3 months ago
- Is Your HD Map Constructor Reliable under Sensor Corruptions?☆37Updated 8 months ago
- [CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".☆28Updated 2 years ago