1e12Leon / AirNavigationLinks
[AAAI2026 demo] Official repo of “AirNavigation: Let UAV Navigation Tells Its Own Story”
☆15Updated 2 months ago
Alternatives and similar repositories for AirNavigation
Users that are interested in AirNavigation are comparing it to the libraries listed below
Sorting:
- [ACM MM 25] Official repo of "UEMM-Air: Enable UAVs to Undertake More Multi-modal Tasks"☆31Updated 4 months ago
- An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching☆108Updated 11 months ago
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆43Updated 6 months ago
- [CVPR 2024🔥] Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization☆111Updated last year
- RefDrone: A Challenging Benchmark for Drone Scene Referring Expression Comprehension☆29Updated 2 weeks ago
- A codebase for flexible and efficient Image Text Representation Alignment☆20Updated 2 years ago
- [AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in…☆36Updated 9 months ago
- [GRSM] Project Page for "GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing"☆61Updated 8 months ago
- [ICCV2025] PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination☆32Updated 2 months ago
- A Survey on Vision-Language Geo-Foundation Models (VLGFMs)☆176Updated 7 months ago
- [ICCV 2025] Where am I? Cross-View Geo-localization with Natural Language Descriptions.☆55Updated last month
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆50Updated 7 months ago
- Visual Grounding with Multi-modal Conditional Adaptation (ACMMM 2024 Oral)☆26Updated 7 months ago
- ☆44Updated last year
- ☆66Updated last month
- ☆129Updated 7 months ago
- [CVPR 2025 Oral] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images☆230Updated 6 months ago
- ☆27Updated last month
- [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"☆222Updated 2 months ago
- [CVPR 2024] Official implementation of "VRP-SAM: SAM with Visual Reference Prompt"☆17Updated last month
- UMB: Understanding Model Behavior for Open-World object Detection (NeurIPS 2024)☆11Updated last year
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆157Updated 2 months ago
- ☆28Updated last year
- Official implementation of the ICCV 2025 paper HoliTracer.☆36Updated last month
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆77Updated 8 months ago
- [AAAI 26] Official PyTorch implementation of Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation☆50Updated 7 months ago
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆100Updated 2 months ago
- [ACM MM 25] Official repo of "RemoteSAM: Towards Segment Anything for Earth Observation"☆197Updated last week
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆44Updated 5 months ago
- The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.☆65Updated 10 months ago