rolsheng / MM-VUFM4DS
A systematic survey of multi-modal and multi-task visual understanding foundation models for driving scenarios
☆47Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for MM-VUFM4DS
- AeDet: Azimuth-invariant Multi-view 3D Object Detection, CVPR2023☆72Updated last year
- ☆53Updated 2 months ago
- Curricular Object Manipulation in LiDAR-based Object Detection(CVPR 2023)☆37Updated last year
- [ACM MM2022, TIP2024] Graph-DETR Series for Multi-View 3D Object Detection☆40Updated last year
- MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation☆87Updated last year
- [ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding☆45Updated last year
- ☆120Updated 4 months ago
- Official repository for the NuScenes-MQA. This paper is accepted by LLVA-AD Workshop at WACV 2024.☆24Updated 11 months ago
- EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network☆41Updated 5 months ago
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆21Updated 4 months ago
- Is Your HD Map Constructor Reliable under Sensor Corruptions?☆37Updated 3 months ago
- [IROS 2024]InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy Prediction☆24Updated 4 months ago
- ☆40Updated 2 months ago
- Official PyTorch implementation of End-to-end 3D Tracking with Decoupled Queries [ICCV 2023]☆58Updated 10 months ago
- DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)☆89Updated 11 months ago
- The multi-view version of MonoDETR on nuScenes dataset☆20Updated 2 years ago
- ☆18Updated 2 years ago
- Codes for ICLR 2024: "MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection"☆64Updated 4 months ago
- Implementation of SimMOD: A Simple Baseline for Multi-Camera 3D Object Detection☆48Updated last year
- Code release for the ECCV 2024 paper 'Fully Test-Time Adaptation for Monocular 3D Object Detection'☆29Updated last month
- [CVPR 2023] BEVGuide: BEV-Guided Multi-Modality Fusion for Driving Perception☆33Updated last year
- MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering☆19Updated 3 months ago
- UniDrive: Towards Universal Driving Perception Across Camera Configurations☆53Updated last month
- [CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".☆27Updated 2 years ago
- [AAAI 2024] BEV-MAE: Bird's Eye View Masked Autoencoders for Point Cloud Pre-training in Autonomous Driving Scenarios☆44Updated 3 months ago
- [ICCV 2023] Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and Class-balanced Pseudo-Labeling☆46Updated last year
- [AAAI24]This is the implementation for the paper M-BEV: Masked BEV Perception for Robust Autonomous Driving☆33Updated 7 months ago
- Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving☆64Updated 3 weeks ago
- Official repository for paper "Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving"☆21Updated 2 months ago
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆37Updated 2 months ago