xmed-lab / NuInstruct
☆59Updated 9 months ago
Alternatives and similar repositories for NuInstruct
Users that are interested in NuInstruct are comparing it to the libraries listed below
Sorting:
- Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving☆83Updated last year
- Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)☆88Updated 5 months ago
- Benchmark and model for step-by-step reasoning in autonomous driving.☆50Updated 2 months ago
- [AAAI2025] Language Prompt for Autonomous Driving☆135Updated 5 months ago
- ☆37Updated 2 months ago
- Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives☆75Updated 2 months ago
- [CVPR 2024] LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs☆30Updated last year
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆90Updated 3 months ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆67Updated 5 months ago
- Official repository for paper "Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving"☆29Updated 2 months ago
- Official repository for the NuScenes-MQA. This paper is accepted by LLVA-AD Workshop at WACV 2024.☆25Updated last year
- End-to-End Driving with Online Trajectory Evaluation via BEV World Model☆71Updated last month
- [CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding☆136Updated last year
- A Multi-Modal Large Language Model with Retrieval-augmented In-context Learning capacity designed for generalisable and explainable end-t…☆95Updated 7 months ago
- [ECCV 2024] The official code for "Dolphins: Multimodal Language Model for Driving“☆73Updated 3 months ago
- Talk2BEV: Language-Enhanced Bird's Eye View Maps (ICRA'24)☆109Updated 6 months ago
- [ECCV 2024] Asynchronous Large Language Model Enhanced Planner for Autonomous Driving☆81Updated 3 weeks ago
- Simulator designed to generate diverse driving scenarios.☆40Updated 2 months ago
- [IROS 2023] DualCross: Cross-Modality Cross-Domain Adaptation for Monocular BEV Perception☆29Updated last year
- [ECCV 2024] Embodied Understanding of Driving Scenarios☆191Updated 4 months ago
- ☆69Updated 4 months ago
- [ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding☆48Updated last year
- [AAAI 2024] NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.☆186Updated 6 months ago
- [ECCV 2024] Official GitHub repository for the paper "LingoQA: Visual Question Answering for Autonomous Driving"☆164Updated 7 months ago
- Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving☆18Updated last week
- [CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training☆47Updated last year
- Street-View Image Generation from a Bird’s-Eye View Layout: Official Codebase☆75Updated last year
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆123Updated 2 months ago
- project page of "RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning"☆15Updated 2 months ago
- 📚 A collection of resources and papers on Large Language Models in autonomous driving☆27Updated last year