InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition (NeurIPS 2025)
☆108Feb 28, 2026Updated 2 months ago
Alternatives and similar repositories for InstructSAM
Users that are interested in InstructSAM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Vision-Language Dataset for Remote Sensing☆43May 27, 2025Updated 11 months ago
- Code and updates for the ScoreRS project.☆42Sep 19, 2025Updated 8 months ago
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆51Feb 16, 2026Updated 3 months ago
- [NeurIPS 2025 D&B] RSCC: A Real-World Remote Sensing Change Caption Dataset☆49Feb 14, 2026Updated 3 months ago
- Official repo for "SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing"☆203Dec 10, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A Scene Graph-Enhanced Remote Sensing Large Vision-Language Model☆143Jan 19, 2026Updated 4 months ago
- [CVPR 2025] This is a model aggregated with CLIP and SAM version of SkySense for remote sensing interpretation described in SkySense-O: T…☆268Aug 27, 2025Updated 8 months ago
- DescribeEarth: Describe Anything for Remote Sensing Images☆26Mar 6, 2026Updated 2 months ago
- Falcon: A Remote Sensing Vision-Language Foundation Model☆373Mar 12, 2026Updated 2 months ago
- ☆29Sep 2, 2025Updated 8 months ago
- Awesome Remote Sensing Vision-Language Datasets☆76May 12, 2026Updated last week
- Official code for TEOChat, the first vision-language assistant for temporal earth observation data (ICLR 2025).☆142Dec 1, 2025Updated 5 months ago
- Code & Dataset repository for the paper "Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation"☆90Nov 21, 2025Updated 6 months ago
- RS5M: a large-scale vision language dataset for remote sensing [TGRS]☆311Mar 17, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Accompanying repo for CVPRW'24: Charting New Territories: Exploring the Geographic and Geospatial Capabilities of Multimodal LLMs☆27May 24, 2025Updated 11 months ago
- Beyond the Visible: Multispectral Vision-Language Learning for Earth Observation☆50Feb 25, 2026Updated 2 months ago
- The official repo for [JSTARS'24] "MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining"☆249Aug 4, 2025Updated 9 months ago
- [AAAI 2026 Oral] DynamicEarth: How Far are We from Open-Vocabulary Change Detection?☆124Dec 23, 2025Updated 5 months ago
- RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation☆20Jun 15, 2025Updated 11 months ago
- This is official implementation of KCR.☆22Aug 17, 2023Updated 2 years ago
- [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"☆238Feb 18, 2026Updated 3 months ago
- [ICLR 2026] The official implementation of the paper “Earth-Agent: Unlocking the Full Landscape of Earth Observation with Agents”☆145Apr 2, 2026Updated last month
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model☆146Jan 21, 2026Updated 4 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [TPAMI2024] Learning to Holistically Detect Bridges from Large-Size VHR Remote Sensing Imagery☆15Mar 18, 2025Updated last year
- GAIA: A global, multimodal, multiscale vision–language dataset for remote sensing image analysis☆33Feb 11, 2026Updated 3 months ago
- ThinkGeo is a Comprehensive Benchmark to evaluate Tool-Augmented Agents for Remote Sensing Tasks☆67Apr 2, 2026Updated last month
- 🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)☆550Jun 27, 2024Updated last year
- GeoPixel: A Pixel Grounding Large Multimodal Model for Remote Sensing is specifically developed for high-resolution remote sensing image …☆144May 11, 2026Updated last week
- Paper list for LLM/MLLM-based image segmentation☆47Dec 24, 2025Updated 4 months ago
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆88May 10, 2025Updated last year
- [CVPR 2025 Oral] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images☆262Jul 9, 2025Updated 10 months ago
- Towards Robust Evaluation for Geospatial Foundation Models☆269Jul 17, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [Nature Machine Intelligence 2025] This repository is the official implementation of the paper "A semantic-enhanced multi-modal remote se…☆220Sep 18, 2025Updated 8 months ago
- [ICLR 2025] Official Pytorch Implementation of MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segm…☆26Apr 3, 2025Updated last year
- The official repo for [NeurIPS'23] "SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model"☆383Aug 5, 2024Updated last year
- [ACM MM 25] Official repo of "RemoteSAM: Towards Segment Anything for Earth Observation"☆233Jan 4, 2026Updated 4 months ago
- A Survey on Vision-Language Geo-Foundation Models (VLGFMs)☆179May 24, 2025Updated 11 months ago
- VGI-Enhanced multimodal large language model for remote sensing images.☆189Mar 4, 2025Updated last year
- [ICLR'25] Official repo of "PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection"☆38Mar 27, 2025Updated last year