InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition (NeurIPS 2025)
☆107Jan 25, 2026Updated last month
Alternatives and similar repositories for InstructSAM
Users that are interested in InstructSAM are comparing it to the libraries listed below
Sorting:
- Code and updates for the ScoreRS project.☆40Sep 19, 2025Updated 5 months ago
- Vision-Language Dataset for Remote Sensing☆40May 27, 2025Updated 9 months ago
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆47Feb 16, 2026Updated 2 weeks ago
- Awesome Remote Sensing Vision-Language Datasets☆46Feb 24, 2026Updated last week
- A Scene Graph-Enhanced Remote Sensing Large Vision-Language Model☆138Jan 19, 2026Updated last month
- Official repo for "SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing"☆198Dec 10, 2024Updated last year
- DescribeEarth: Describe Anything for Remote Sensing Images☆23Feb 24, 2026Updated last week
- This is official implementation of KCR.☆22Aug 17, 2023Updated 2 years ago
- GAIA: A global, multimodal, multiscale vision–language dataset for remote sensing image analysis☆31Feb 11, 2026Updated 2 weeks ago
- [NeurIPS 2025 D&B] RSCC: A Real-World Remote Sensing Change Caption Dataset☆43Feb 14, 2026Updated 2 weeks ago
- Falcon: A Remote Sensing Vision-Language Foundation Model☆359Apr 10, 2025Updated 10 months ago
- ☆28Sep 2, 2025Updated 6 months ago
- [TPAMI2024] Learning to Holistically Detect Bridges from Large-Size VHR Remote Sensing Imagery☆14Mar 18, 2025Updated 11 months ago
- RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation☆19Jun 15, 2025Updated 8 months ago
- Accompanying repo for CVPRW'24: Charting New Territories: Exploring the Geographic and Geospatial Capabilities of Multimodal LLMs☆27May 24, 2025Updated 9 months ago
- [CVPR 2025] This is a model aggregated with CLIP and SAM version of SkySense for remote sensing interpretation described in SkySense-O: T…☆259Aug 27, 2025Updated 6 months ago
- [AAAI 2026 Oral] DynamicEarth: How Far are We from Open-Vocabulary Change Detection?☆109Dec 23, 2025Updated 2 months ago
- [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"☆228Feb 18, 2026Updated last week
- [IEEE TPAMI 2025] REST: Holistic Learning for End-to-End Semantic Segmentation of Whole-Scene Remote Sensing Imagery☆35Sep 19, 2025Updated 5 months ago
- Beyond the Visible: Multispectral Vision-Language Learning for Earth Observation☆46Updated this week
- [ACM MM 25] Official repo of "RemoteSAM: Towards Segment Anything for Earth Observation"☆207Jan 4, 2026Updated last month
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆79May 10, 2025Updated 9 months ago
- Official code for TEOChat, the first vision-language assistant for temporal earth observation data (ICLR 2025).☆135Dec 1, 2025Updated 3 months ago
- Paper list for LLM/MLLM-based image segmentation☆47Dec 24, 2025Updated 2 months ago
- GeoPixel: A Pixel Grounding Large Multimodal Model for Remote Sensing is specifically developed for high-resolution remote sensing image …☆143May 28, 2025Updated 9 months ago
- Code & Dataset repository for the paper "Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation"☆84Nov 21, 2025Updated 3 months ago
- RS5M: a large-scale vision language dataset for remote sensing [TGRS]☆298Mar 17, 2025Updated 11 months ago
- [ICLR'25] Official repo of "PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection"☆38Mar 27, 2025Updated 11 months ago
- [Nature Machine Intelligence 2025] This repository is the official implementation of the paper "A semantic-enhanced multi-modal remote se…☆189Sep 18, 2025Updated 5 months ago
- The official repo for [JSTARS'24] "MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining"☆247Aug 4, 2025Updated 6 months ago
- Towards Robust Evaluation for Geospatial Foundation Models☆255Jul 17, 2025Updated 7 months ago
- [CVPR 2025 Oral] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images☆245Jul 9, 2025Updated 7 months ago
- VGI-Enhanced multimodal large language model for remote sensing images.☆182Mar 4, 2025Updated 11 months ago
- A Survey on Vision-Language Geo-Foundation Models (VLGFMs)☆177May 24, 2025Updated 9 months ago
- A collection of papers related to Geo-spatial Information Science in CVPR 2025.☆39Apr 1, 2025Updated 11 months ago
- ThinkGeo is a Comprehensive Benchmark to evaluate Tool-Augmented Agents for Remote Sensing Tasks☆59Feb 20, 2026Updated last week
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model☆141Jan 21, 2026Updated last month
- 🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)☆519Jun 27, 2024Updated last year
- [ICLR 2026] The official implementation of the paper “Earth-Agent: Unlocking the Full Landscape of Earth Observation with Agents”☆97Feb 1, 2026Updated last month