rolsheng / MM-VUFM4DSLinks
【IEEE T-IV】A systematic survey of multi-modal and multi-task visual understanding foundation models for driving scenarios
☆51Updated last year
Alternatives and similar repositories for MM-VUFM4DS
Users that are interested in MM-VUFM4DS are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆31Updated 3 months ago
- ☆41Updated 3 weeks ago
- AeDet: Azimuth-invariant Multi-view 3D Object Detection, CVPR2023☆74Updated 2 years ago
- [ACM MM2022, TIP2024] Graph-DETR Series for Multi-View 3D Object Detection☆41Updated last year
- Benchmark and model for step-by-step reasoning in autonomous driving.☆56Updated 3 months ago
- the official code of DriveMonkey☆23Updated last month
- MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation☆92Updated 2 years ago
- ☆73Updated 6 months ago
- ☆76Updated 3 months ago
- [ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding☆47Updated last year
- Official repository for paper "Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving"☆29Updated last month
- Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving☆21Updated 3 weeks ago
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆95Updated 5 months ago
- Curricular Object Manipulation in LiDAR-based Object Detection(CVPR 2023)☆38Updated last year
- OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection☆52Updated 6 months ago
- [CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".☆28Updated 2 years ago
- Is Your HD Map Constructor Reliable under Sensor Corruptions?☆37Updated 10 months ago
- Official repository for the NuScenes-MQA. This paper is accepted by LLVA-AD Workshop at WACV 2024.☆25Updated last year
- Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane Priors☆64Updated 10 months ago
- Codes for ICLR 2024: "MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection"☆73Updated 11 months ago
- [AAAI2025] Language Prompt for Autonomous Driving☆139Updated 6 months ago
- [ECCV 2024] SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras☆25Updated 9 months ago
- [AAAI24]This is the implementation for the paper M-BEV: Masked BEV Perception for Robust Autonomous Driving☆38Updated last year
- ☆53Updated 10 months ago
- ☆62Updated 10 months ago
- DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)☆103Updated last year
- [IROS 2024]InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy Prediction☆28Updated 11 months ago
- MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering☆19Updated 10 months ago
- EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network☆52Updated 2 months ago
- ☆46Updated 6 months ago