Wenzhuo-Liu / MMTL-UniADLinks
MMTL-UniAD: A Unified Framework for Multimodal and Multi-Task Learning in Assistive Driving Perception
☆24Updated 2 months ago
Alternatives and similar repositories for MMTL-UniAD
Users that are interested in MMTL-UniAD are comparing it to the libraries listed below
Sorting:
- ☆16Updated 2 years ago
- Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models☆19Updated 7 months ago
- Official implementation of "InScope: A New Real-world 3D Infrastructure-side Collaborative Perception Dataset for Open Traffic Scenarios"☆25Updated last week
- This repo contains the code for paper "LightEMMA: Lightweight End-to-End Multimodal Model for Autonomous Driving"☆126Updated last week
- [NeurIPS 2025] SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models☆73Updated 2 months ago
- FlowDrive: Energy Flow Field for End-to-End Autonomous Driving☆38Updated 2 months ago
- [NeurIPS 2023] Asynchrony-Robust Collaborative Perception via Bird’s Eye View Flow☆81Updated 2 years ago
- ☆73Updated 3 months ago
- [IROS'25] CoMamba: Real-time Cooperative Perception Unlocked with State Space Models☆25Updated last year
- Griffin: Aerial-Ground Cooperative Detection and Tracking Benchmark☆73Updated 3 months ago
- ☆27Updated last year
- The official implementation of the ECCV 2024 paper: Continuity Preserving Online CenterLine Graph Learning☆32Updated 11 months ago
- Benchmark and model for step-by-step reasoning in autonomous driving.☆66Updated 8 months ago
- 【IEEE T-IV】A systematic survey of multi-modal and multi-task visual understanding foundation models for driving scenarios☆50Updated last year
- Track 1: Driving with Language☆24Updated 3 months ago
- Official repository for the NuScenes-MQA. This paper is accepted by LLVA-AD Workshop at WACV 2024.☆34Updated last year
- [AAAI 2025] Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving☆46Updated 6 months ago
- Repo for 'VLM-Auto: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes'☆24Updated last year
- This repository is a paper summary of the latest progress in cooperative/collaborative/multi-agent perception datasets in autonomous dri…☆32Updated 3 months ago
- Code release for the ECCV 2024 paper 'Fully Test-Time Adaptation for Monocular 3D Object Detection'☆55Updated 11 months ago
- Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)☆95Updated 11 months ago
- [CVPR2024] Official implementation of "RCooper: A Real-world Large-scale Dataset for Roadside Cooperative Perception"☆110Updated last year
- ☆74Updated last year
- [AAAI2025] Language Prompt for Autonomous Driving☆150Updated 2 months ago
- ☆95Updated 11 months ago
- [Communication in Transprotation Reasearch] Official PyTorch Implementation of ''GPT-4 enhanced multimodal grounding for autonomous driv…☆25Updated last year
- Talk2BEV: Language-Enhanced Bird's Eye View Maps (ICRA'24)☆113Updated last year
- ☆65Updated last year
- [IROS2023] Calibration-free BEV Representation for Infrastructure Perception☆41Updated 2 years ago
- CVPR 2024 Papers Autonomous Driving☆38Updated 2 years ago