☆33Dec 18, 2023Updated 2 years ago
Alternatives and similar repositories for DriveVLM
Users that are interested in DriveVLM are comparing it to the libraries listed below
Sorting:
- ☆94Oct 2, 2024Updated last year
- ☆15Sep 11, 2023Updated 2 years ago
- ☆73Aug 17, 2025Updated 6 months ago
- ☆21Feb 29, 2024Updated 2 years ago
- [Official] [IROS 2024] A goal-oriented planning to lift VLN performance for Closed-Loop Navigation: Simple, Yet Effective☆28Apr 4, 2024Updated last year
- Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment☆30Feb 24, 2026Updated last week
- Cloning the MT9721 driver into the MT7902 driver in the hopes of getting something running.☆18Mar 2, 2026Updated last week
- Building a multi-agent RAG system with advanced RAG methods☆12Jan 12, 2025Updated last year
- ☆13Nov 30, 2024Updated last year
- This is the code repository for a project at Ulm University. It's a fall detection system based on address-event-based cameras.☆11Sep 29, 2017Updated 8 years ago
- use python to control tello drone☆11May 31, 2021Updated 4 years ago
- code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"☆46Sep 21, 2023Updated 2 years ago
- [CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models☆866Apr 14, 2025Updated 10 months ago
- This is the official project repository for "TopoStreamer: Temporal Lane Segment Topology Reasoning in Autonomous Driving"☆17Jul 31, 2025Updated 7 months ago
- An official code of Densely-packed Object Detection via Hard Negative-Aware Anchor Attention in WACV2022☆12Jan 6, 2022Updated 4 years ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆25May 31, 2025Updated 9 months ago
- ☆12Feb 2, 2024Updated 2 years ago
- ☆28Jan 5, 2026Updated 2 months ago
- Fine Tuning Stable Diffusion on Chinese Landscape Painting Generation(基于扩散模型的中国山水画生成)☆10Apr 10, 2023Updated 2 years ago
- Image search based on convolutional neural network feature extraction.☆14May 11, 2018Updated 7 years ago
- Finetune the controlnet+stable diffusion model using diffuser☆11Sep 18, 2023Updated 2 years ago
- Gemma3的comfyui版本☆10Sep 6, 2025Updated 6 months ago
- mouse pet-ct image segmentation☆12Feb 19, 2023Updated 3 years ago
- Unofficial implementation of 'Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator'☆10Dec 10, 2024Updated last year
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- Surrogate Modeling of the Aerodynamic Performance for Transonic Regime☆13Feb 12, 2024Updated 2 years ago
- Text Detection by RetinaNet with PyTorch (Code will be released soon)☆10Dec 1, 2018Updated 7 years ago
- Text-to-Drive: Diverse Driving Behaviors Synthesis via Large Language Models☆11Mar 17, 2024Updated last year
- PyTorch implementation of Supercombo, an end-to-end model for Level 2 autonomous driving on a single device (OpenPilot)☆12Jun 27, 2022Updated 3 years ago
- Generate a 3D BIM Model from 2D CAD Drawings☆12Nov 23, 2022Updated 3 years ago
- Pytorch、Numpy实现NMS、Soft-NMS代码☆12Mar 22, 2021Updated 4 years ago
- ☆11Nov 14, 2021Updated 4 years ago
- comfyui的m3net插件,m3net是不错的显著性检测模型,抠图上效果不错,我开源了一个训练的电商的模型,供大家试玩☆12Aug 16, 2024Updated last year
- ☆17Oct 25, 2023Updated 2 years ago
- [T-ITS 2024] EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving☆13Jun 8, 2025Updated 9 months ago
- ☆12Aug 23, 2019Updated 6 years ago
- Code and database for Jacquot et al. CVPR 2020. Can we decode subtle human activities?☆12Dec 22, 2020Updated 5 years ago
- Collect VLM models that can be tried online.☆14Apr 15, 2024Updated last year
- decision-making processes of human drivers☆13Mar 28, 2024Updated last year