buaa-colalab / UAV-FlowLinks
☆30Updated last month
Alternatives and similar repositories for UAV-Flow
Users that are interested in UAV-Flow are comparing it to the libraries listed below
Sorting:
- ☆71Updated last month
- [ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models☆31Updated 11 months ago
- [AAAI-25 Oral] Official Implementation of "FLAME: Learning to Navigate with Multimodal LLM in Urban Environments"☆57Updated 4 months ago
- Official implementation for "HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human …☆30Updated last month
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆107Updated last month
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding☆131Updated 2 months ago
- ☆132Updated last week
- [CVPR2025] CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos☆102Updated 2 months ago
- ☆18Updated 8 months ago
- ☆41Updated 3 weeks ago
- [NeurIPS 2024] Official code repository for MSR3D paper☆60Updated last week
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆54Updated 10 months ago
- ☆14Updated last month
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆109Updated 3 months ago
- ☆37Updated 2 weeks ago
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆42Updated last year
- [CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"☆130Updated 2 weeks ago
- ☆49Updated 8 months ago
- An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching☆87Updated 5 months ago
- ☆13Updated last year
- Unifying 2D and 3D Vision-Language Understanding☆86Updated 2 months ago
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆74Updated 8 months ago
- Unified Vision-Language-Action Model☆61Updated this week
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆67Updated 3 months ago
- [IROS'25 Oral] WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation☆80Updated last week
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆126Updated 3 months ago
- Repository for Vision-and-Language Navigation via Causal Learning (Accepted by CVPR 2024)☆75Updated 3 weeks ago
- Segment Anything with Deictic Prompting☆26Updated last month
- ☆29Updated 7 months ago
- ☆21Updated 5 months ago