xyz9911 / FLAME
FLAME: Learning to Navigate with Multimodal LLM in Urban Environments (arXiv:2408.11051)
☆12Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for FLAME
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆29Updated 2 months ago
- ☆11Updated last year
- Official implementation of Learning from Unlabeled 3D Environments for Vision-and-Language Navigation (ECCV'22).☆34Updated last year
- ☆45Updated last month
- Repository for Vision-and-Language Navigation via Causal Learning (Accepted by CVPR 2024)☆37Updated 3 weeks ago
- [NeurIPS 2024] Official code repository for MSR3D paper☆24Updated this week
- [ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting☆19Updated 4 months ago
- Implementation of our ICCV 2023 paper DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation☆19Updated last year
- Official Implementation of Learning Navigational Visual Representations with Semantic Map Supervision (ICCV2023)☆24Updated last year
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆53Updated last month
- Official implementation of Layout-aware Dreamer for Embodied Referring Expression Grounding (AAAI'23).☆16Updated last year
- Planning as In-Painting: A Diffusion-Based Embodied Task Planning Framework for Environments under Uncertainty☆18Updated 11 months ago
- ☆25Updated last year
- ☆26Updated last year
- Official implementation of Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation (CVPR'24 H…☆39Updated 4 months ago
- ☆14Updated 10 months ago
- Repository of our accepted NeurIPS-2022 paper "Towards Versatile Embodied Navigation"☆20Updated last year
- Code of the ICCV 2023 paper "March in Chat: Interactive Prompting for Remote Embodied Referring Expression"☆24Updated 6 months ago
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆44Updated 3 months ago
- Code of 3DMIT: 3D MULTI-MODAL INSTRUCTION TUNING FOR SCENE UNDERSTANDING☆24Updated 3 months ago
- [ICCV'23] Learning Vision-and-Language Navigation from YouTube Videos☆41Updated last year
- ☆9Updated 4 months ago
- Official Implementation of Frequency-enhanced Data Augmentation for Vision-and-Language Navigation (NeurIPS2023)☆13Updated 10 months ago
- Official implementation of "Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation" (ICCV 2023 Oral)☆17Updated last year
- Official implementation of Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions (Ne…☆18Updated 3 weeks ago
- Fast-Slow Test-time Adaptation for Online Vision-and-Language Navigation☆18Updated last month
- [ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models☆24Updated 4 months ago
- ☆12Updated last year
- ☆40Updated last year
- ☆10Updated last month