rolsheng / MM-VUFM4DS
A systematic survey of multi-modal and multi-task visual understanding foundation models for driving scenarios
☆48Updated 8 months ago
Alternatives and similar repositories for MM-VUFM4DS:
Users that are interested in MM-VUFM4DS are comparing it to the libraries listed below
- AeDet: Azimuth-invariant Multi-view 3D Object Detection, CVPR2023☆73Updated last year
- ☆66Updated 4 months ago
- MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation☆90Updated last year
- Official repository for the NuScenes-MQA. This paper is accepted by LLVA-AD Workshop at WACV 2024.☆25Updated last year
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆26Updated 2 months ago
- DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction☆25Updated last month
- ☆29Updated 2 months ago
- [ACM MM2022, TIP2024] Graph-DETR Series for Multi-View 3D Object Detection☆40Updated last year
- Curricular Object Manipulation in LiDAR-based Object Detection(CVPR 2023)☆37Updated last year
- ☆47Updated last month
- OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection☆33Updated 2 months ago
- [IROS 2024]InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy Prediction☆27Updated 6 months ago
- [ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding☆46Updated last year
- EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network☆41Updated 7 months ago
- Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane Priors☆57Updated 5 months ago
- [AAAI2025] Language Prompt for Autonomous Driving☆129Updated last month
- [AAAI24]This is the implementation for the paper M-BEV: Masked BEV Perception for Robust Autonomous Driving☆36Updated 10 months ago
- Official repository for paper "Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving"☆26Updated last month
- Official PyTorch implementation of End-to-end 3D Tracking with Decoupled Queries [ICCV 2023]☆58Updated last year
- Codes for ICLR 2024: "MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection"☆68Updated 6 months ago
- ☆54Updated 5 months ago
- [ICCV 2023] Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and Class-balanced Pseudo-Labeling☆47Updated last year
- ☆24Updated 11 months ago
- [NeurIPS 2024] Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving☆106Updated last week
- DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)☆94Updated last year
- (AAAI2024) Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving☆20Updated last year
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆76Updated last week
- MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering☆19Updated 5 months ago
- [CVPR 2023] BEVGuide: BEV-Guided Multi-Modality Fusion for Driving Perception☆36Updated last year
- VQ-Map[NeurIPS 2024]☆21Updated 3 weeks ago