AIS-Clemson / VisionGPTLinks
LLM-Assisted Real-Time Anomaly Detection for Safe Visual Navigation
☆37Updated last year
Alternatives and similar repositories for VisionGPT
Users that are interested in VisionGPT are comparing it to the libraries listed below
Sorting:
- The official repository for paper: Agentic Robot: A Brain-Inspired Framework for Vision-Language-Action Models in Embodied Agents☆25Updated 5 months ago
- ☆46Updated 5 months ago
- ☆54Updated last year
- [CVPRW 2024] TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning. Official code for the 3rd place solution of t…☆48Updated 9 months ago
- A Multi-Modal Large Language Model with Retrieval-augmented In-context Learning capacity designed for generalisable and explainable end-t…☆113Updated last year
- ☆27Updated last year
- [WAVC'24 Workshop] Human-Centric Autonomous Systems With LLMs for User Command Reasoning☆17Updated last year
- Image Instance Segmentation - Zero Shot - OpenAI's CLIP + Meta's SAM☆72Updated 2 years ago
- ☆19Updated last year
- MMTL-UniAD: A Unified Framework for Multimodal and Multi-Task Learning in Assistive Driving Perception☆24Updated 2 months ago
- ☆77Updated last month
- Repo for 'VLM-Auto: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes'☆24Updated last year
- ☆84Updated 6 months ago
- ☆286Updated 8 months ago
- ☆10Updated last year
- [Official] [IROS 2024] A goal-oriented planning to lift VLN performance for Closed-Loop Navigation: Simple, Yet Effective☆29Updated last year
- Human Scene Transformer: A framework for trajectory prediction and wrappers for reframing the JRDB dataset for the prediction task.☆72Updated last year
- Official codebase for "TAU-106K: A New Dataset for Comprehensive Understanding of Traffic Accident"☆18Updated 7 months ago
- Official Code for DOROTHIE: Spoken Dialogue for Handling Unexpected Situations in Interactive Autonomous Driving Agents (Findings of EMNL…☆22Updated 2 years ago
- [ICML 2025] Official implementation of TraffiX-Qwen model introduced in TUMTraf VideoQA benchmark for roadside traffic video understandin…☆25Updated 2 months ago
- [ACL 24] The official implementation of MapGPT: Map-Guided Prompting with Adaptive Path Planning for Vision-and-Language Navigation.☆116Updated 6 months ago
- ☆58Updated last year
- Implementation for paper "Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Model"☆96Updated 11 months ago
- ☆28Updated 5 months ago
- Code repository for SMART-LLM: Smart Multi-Agent Robot Task Planning using Large Language Models☆168Updated last year
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆317Updated 2 months ago
- Toolkit for JRDB dataset☆43Updated last year
- Repository about single/multi-agent, robotics, llm/vlm/vla, scientific discovery, etc.☆17Updated 4 months ago
- A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space☆91Updated 10 months ago
- 基于InternLM2大模型的离线具身智能导盲犬☆106Updated last year