Pendulumclock / FlightGPTLinks
☆20Updated 2 weeks ago
Alternatives and similar repositories for FlightGPT
Users that are interested in FlightGPT are comparing it to the libraries listed below
Sorting:
- ☆109Updated 2 months ago
- [AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models☆259Updated last year
- [AAAI-25 Oral] Official Implementation of "FLAME: Learning to Navigate with Multimodal LLM in Urban Environments"☆59Updated 5 months ago
- ☆22Updated 5 months ago
- Repository for Vision-and-Language Navigation via Causal Learning (Accepted by CVPR 2024)☆81Updated 2 months ago
- [ECCV 2024] Official implementation of NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models☆197Updated 10 months ago
- Embodied Question Answering (EQA) benchmark and method (ICCV 2025)☆29Updated last month
- ☆158Updated last month
- LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [Actively Maintained🔥]☆122Updated last week
- ☆77Updated 2 months ago
- Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).☆199Updated 2 years ago
- Code of the paper "NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning" (TPAMI 2025)☆92Updated 2 months ago
- ☆13Updated last month
- [ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting☆24Updated 7 months ago
- ⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆186Updated 2 weeks ago
- [RSS 2024] NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation☆227Updated 3 weeks ago
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆46Updated last year
- [ACL 24] The official implementation of MapGPT: Map-Guided Prompting with Adaptive Path Planning for Vision-and-Language Navigation.☆99Updated 3 months ago
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆199Updated last year
- ☆158Updated last month
- The new spin-off of Visual Language Navigation.☆24Updated last month
- Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"☆100Updated 5 months ago
- ☆10Updated last year
- VELMA agent for VLN in Street View☆24Updated last year
- official implementation of NeurIPS 2023 paper "FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation"☆33Updated last year
- The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`☆129Updated 7 months ago
- Codebase of ACL 2023 Findings "Aerial Vision-and-Dialog Navigation"☆52Updated 9 months ago
- The Official Implementation of RoboMatrix☆95Updated 2 months ago
- ☆19Updated 9 months ago
- ☆159Updated 4 months ago