InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition (NeurIPS 2025)
☆108Feb 28, 2026Updated 3 weeks ago
Alternatives and similar repositories for InstructSAM
Users that are interested in InstructSAM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Vision-Language Dataset for Remote Sensing☆41May 27, 2025Updated 9 months ago
- Code and updates for the ScoreRS project.☆42Sep 19, 2025Updated 6 months ago
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆49Feb 16, 2026Updated last month
- [NeurIPS 2025 D&B] RSCC: A Real-World Remote Sensing Change Caption Dataset☆44Feb 14, 2026Updated last month
- Awesome Remote Sensing Vision-Language Datasets☆61Updated this week
- A Scene Graph-Enhanced Remote Sensing Large Vision-Language Model☆140Jan 19, 2026Updated 2 months ago
- DescribeEarth: Describe Anything for Remote Sensing Images☆24Mar 6, 2026Updated 2 weeks ago
- Falcon: A Remote Sensing Vision-Language Foundation Model☆358Mar 12, 2026Updated last week
- Official repo for "SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing"☆200Dec 10, 2024Updated last year
- ☆29Sep 2, 2025Updated 6 months ago
- [CVPR 2025] This is a model aggregated with CLIP and SAM version of SkySense for remote sensing interpretation described in SkySense-O: T…☆264Aug 27, 2025Updated 6 months ago
- Official code for TEOChat, the first vision-language assistant for temporal earth observation data (ICLR 2025).☆138Dec 1, 2025Updated 3 months ago
- Code & Dataset repository for the paper "Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation"☆85Nov 21, 2025Updated 4 months ago
- Accompanying repo for CVPRW'24: Charting New Territories: Exploring the Geographic and Geospatial Capabilities of Multimodal LLMs☆27May 24, 2025Updated 9 months ago
- Beyond the Visible: Multispectral Vision-Language Learning for Earth Observation☆47Feb 25, 2026Updated 3 weeks ago
- [AAAI 2026 Oral] DynamicEarth: How Far are We from Open-Vocabulary Change Detection?☆112Dec 23, 2025Updated 3 months ago
- [ICLR 2026] The official implementation of the paper “Earth-Agent: Unlocking the Full Landscape of Earth Observation with Agents”☆110Feb 1, 2026Updated last month
- RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation☆20Jun 15, 2025Updated 9 months ago
- This is official implementation of KCR.☆22Aug 17, 2023Updated 2 years ago
- [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"☆231Feb 18, 2026Updated last month
- OpenEarthAgent is a unified framework for tool-augmented geospatial agents.☆44Mar 8, 2026Updated 2 weeks ago
- [TPAMI2024] Learning to Holistically Detect Bridges from Large-Size VHR Remote Sensing Imagery☆15Mar 18, 2025Updated last year
- RS5M: a large-scale vision language dataset for remote sensing [TGRS]☆301Mar 17, 2025Updated last year
- GAIA: A global, multimodal, multiscale vision–language dataset for remote sensing image analysis☆31Feb 11, 2026Updated last month
- ThinkGeo is a Comprehensive Benchmark to evaluate Tool-Augmented Agents for Remote Sensing Tasks☆61Feb 20, 2026Updated last month
- GeoPixel: A Pixel Grounding Large Multimodal Model for Remote Sensing is specifically developed for high-resolution remote sensing image …☆143May 28, 2025Updated 9 months ago
- The official repo for [JSTARS'24] "MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining"☆247Aug 4, 2025Updated 7 months ago
- Paper list for LLM/MLLM-based image segmentation☆47Dec 24, 2025Updated 2 months ago
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆80May 10, 2025Updated 10 months ago
- [CVPR 2025 Oral] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images☆250Jul 9, 2025Updated 8 months ago
- Towards Robust Evaluation for Geospatial Foundation Models☆259Jul 17, 2025Updated 8 months ago
- [Nature Machine Intelligence 2025] This repository is the official implementation of the paper "A semantic-enhanced multi-modal remote se …☆197Sep 18, 2025Updated 6 months ago
- 🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)☆525Jun 27, 2024Updated last year
- [ICLR 2025] Official Pytorch Implementation of MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segm…☆24Apr 3, 2025Updated 11 months ago
- [IEEE TPAMI 2025] REST: Holistic Learning for End-to-End Semantic Segmentation of Whole-Scene Remote Sensing Imagery☆36Updated this week
- [ACM MM 25] Official repo of "RemoteSAM: Towards Segment Anything for Earth Observation"☆216Jan 4, 2026Updated 2 months ago
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model☆143Jan 21, 2026Updated 2 months ago
- A Survey on Vision-Language Geo-Foundation Models (VLGFMs)☆177May 24, 2025Updated 9 months ago
- [ICLR 2025] SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement☆88Apr 19, 2025Updated 11 months ago