VisionXLab / AirSpatialBotLinks
[TGRS'25] AirSpatialBot: A Spatially-Aware Aerial Agent for Fine-Grained Vehicle Attribute Recognization and Retrieval
☆23Updated 3 months ago
Alternatives and similar repositories for AirSpatialBot
Users that are interested in AirSpatialBot are comparing it to the libraries listed below
Sorting:
- ☆43Updated 10 months ago
- ☆36Updated last year
- Code & Dataset repository for the paper "Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation"☆72Updated last week
- EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing☆19Updated 6 months ago
- ☆19Updated last year
- Awesome Referring Remote Sensing Image Segmentation☆22Updated 2 months ago
- Vision-Language Dataset for Remote Sensing☆40Updated 6 months ago
- Code and updates for the ScoreRS project.☆34Updated 2 months ago
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model☆129Updated 3 months ago
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆47Updated 5 months ago
- ☆19Updated last year
- ☆64Updated 6 months ago
- [AAAI 2025] Official PyTorch implementation of "ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation"☆36Updated 3 months ago
- Paper List on Earth Observation in the Foundation Model Era☆27Updated last week
- [CVPR 2025 Highlight] Change3D: Revisiting Change Detection and Captioning from A Video Modeling Perspective.☆59Updated 4 months ago
- RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation☆19Updated 5 months ago
- Paper list for LLM/MLLM-based image segmentation☆44Updated 2 weeks ago
- ☆90Updated 9 months ago
- This is the implement of the paper "RSRefSeg 2: Decoupling Referring Remote Sensing Image Segmentation with Foundation Models"☆20Updated 4 months ago
- [AAAI 2026 Oral] DynamicEarth: How Far are We from Open-Vocabulary Change Detection?☆97Updated 6 months ago
- This is the pytorch implement of the paper "RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models"☆65Updated 4 months ago
- ☆58Updated last month
- The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.☆64Updated 9 months ago
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆39Updated 4 months ago
- [ISPRS2025] SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model☆104Updated 2 months ago
- ☆63Updated 10 months ago
- DOFA-CLIP: Multimodal Vision–Language Foundation Models for Earth Observation☆30Updated 4 months ago
- ☆16Updated last year
- [TGRS 2024] Co-training Transformer for Remote Sensing Image Classification, Segmentation and Detection.☆45Updated 8 months ago
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding☆128Updated 3 months ago