xmed-lab / NuInstruct
☆46Updated last month
Related projects: ⓘ
- Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)☆57Updated 2 months ago
- Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving☆70Updated 8 months ago
- ☆114Updated 2 months ago
- [CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding☆83Updated 10 months ago
- [IROS 2023] DualCross: Cross-Modality Cross-Domain Adaptation for Monocular BEV Perception☆28Updated 9 months ago
- [ECCV 2024] The official code for "Dolphins: Multimodal Language Model for Driving“☆29Updated 2 months ago
- [ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding☆43Updated last year
- [AAAI 2024] NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.☆145Updated 9 months ago
- [CVPR 2024] LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs☆23Updated 5 months ago
- DetMatch: Two Teachers are Better Than One for Joint 2D and 3D Semi-Supervised Object Detection☆35Updated last year
- [CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training☆47Updated last year
- ☆12Updated 3 months ago
- [ECCV 2024] Embodied Understanding of Driving Scenarios☆137Updated 2 weeks ago
- Official Code Release of Delphi☆44Updated 3 months ago
- ☆36Updated this week
- Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving☆52Updated 3 months ago
- Enhancing End-to-End Autonomous Driving with Latent World Model☆73Updated 3 months ago
- Official GitHub repository for the paper "LingoQA: Video Question Answering for Autonomous Driving"☆106Updated 5 months ago
- ☆12Updated last month
- A Multi-Modal Large Language Model with Retrieval-augmented In-context Learning capacity designed for generalisable and explainable end-t…☆63Updated 2 months ago
- This is the official implementation of "LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels" (Accepted at C…☆22Updated 3 months ago
- Simulator-conditioned Driving Scene Generation☆45Updated 2 months ago
- Talk2BEV: Language-Enhanced Bird's Eye View Maps (Accepted to ICRA'24)☆92Updated 7 months ago
- A systematic survey of multi-modal and multi-task visual understanding foundation models for driving scenarios☆45Updated 3 months ago
- ☆51Updated 10 months ago
- Official repository for the NuScenes-MQA. This paper is accepted by LLVA-AD Workshop at WACV 2024.☆20Updated 8 months ago
- The Official Implementation of PillarNeSt☆37Updated last month
- Codes for ICLR 2024: "MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection"☆61Updated 2 months ago
- The multi-view version of MonoDETR on nuScenes dataset☆19Updated last year
- Awesome Papers about World Models in Autonomous Driving☆61Updated 4 months ago