InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition (NeurIPS 2025)
☆107Feb 28, 2026Updated last month
Alternatives and similar repositories for InstructSAM
Users that are interested in InstructSAM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Vision-Language Dataset for Remote Sensing☆42May 27, 2025Updated 10 months ago
- Code and updates for the ScoreRS project.☆42Sep 19, 2025Updated 6 months ago
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆50Feb 16, 2026Updated last month
- [NeurIPS 2025 D&B] RSCC: A Real-World Remote Sensing Change Caption Dataset☆48Feb 14, 2026Updated last month
- A Scene Graph-Enhanced Remote Sensing Large Vision-Language Model☆143Jan 19, 2026Updated 2 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- DescribeEarth: Describe Anything for Remote Sensing Images☆24Mar 6, 2026Updated last month
- Awesome Remote Sensing Vision-Language Datasets☆65Mar 17, 2026Updated 3 weeks ago
- Falcon: A Remote Sensing Vision-Language Foundation Model☆360Mar 12, 2026Updated last month
- Official repo for "SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing"☆201Dec 10, 2024Updated last year
- ☆29Sep 2, 2025Updated 7 months ago
- [CVPR 2025] This is a model aggregated with CLIP and SAM version of SkySense for remote sensing interpretation described in SkySense-O: T…☆265Aug 27, 2025Updated 7 months ago
- Code & Dataset repository for the paper "Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation"☆85Nov 21, 2025Updated 4 months ago
- Official code for TEOChat, the first vision-language assistant for temporal earth observation data (ICLR 2025).☆141Dec 1, 2025Updated 4 months ago
- Accompanying repo for CVPRW'24: Charting New Territories: Exploring the Geographic and Geospatial Capabilities of Multimodal LLMs☆27May 24, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Beyond the Visible: Multispectral Vision-Language Learning for Earth Observation☆49Feb 25, 2026Updated last month
- [AAAI 2026 Oral] DynamicEarth: How Far are We from Open-Vocabulary Change Detection?☆117Dec 23, 2025Updated 3 months ago
- This is official implementation of KCR.☆22Aug 17, 2023Updated 2 years ago
- RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation☆20Jun 15, 2025Updated 9 months ago
- [ICLR 2026] The official implementation of the paper “Earth-Agent: Unlocking the Full Landscape of Earth Observation with Agents”☆123Apr 2, 2026Updated last week
- [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"☆237Feb 18, 2026Updated last month
- [TPAMI2024] Learning to Holistically Detect Bridges from Large-Size VHR Remote Sensing Imagery☆15Mar 18, 2025Updated last year
- GAIA: A global, multimodal, multiscale vision–language dataset for remote sensing image analysis☆32Feb 11, 2026Updated 2 months ago
- RS5M: a large-scale vision language dataset for remote sensing [TGRS]☆304Mar 17, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ThinkGeo is a Comprehensive Benchmark to evaluate Tool-Augmented Agents for Remote Sensing Tasks☆63Apr 2, 2026Updated last week
- The official repo for [JSTARS'24] "MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining"☆248Aug 4, 2025Updated 8 months ago
- GeoPixel: A Pixel Grounding Large Multimodal Model for Remote Sensing is specifically developed for high-resolution remote sensing image …☆145May 28, 2025Updated 10 months ago
- Paper list for LLM/MLLM-based image segmentation☆46Dec 24, 2025Updated 3 months ago
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆84May 10, 2025Updated 11 months ago
- [CVPR 2025 Oral] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images☆254Jul 9, 2025Updated 9 months ago
- Towards Robust Evaluation for Geospatial Foundation Models☆263Jul 17, 2025Updated 8 months ago
- [Nature Machine Intelligence 2025] This repository is the official implementation of the paper "A semantic-enhanced multi-modal remote se…☆204Sep 18, 2025Updated 6 months ago
- 🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)☆530Jun 27, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [ICLR 2025] Official Pytorch Implementation of MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segm…☆25Apr 3, 2025Updated last year
- [ACM MM 25] Official repo of "RemoteSAM: Towards Segment Anything for Earth Observation"☆218Jan 4, 2026Updated 3 months ago
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model☆143Jan 21, 2026Updated 2 months ago
- A Survey on Vision-Language Geo-Foundation Models (VLGFMs)☆177May 24, 2025Updated 10 months ago
- VGI-Enhanced multimodal large language model for remote sensing images.☆186Mar 4, 2025Updated last year
- [ICLR'25] Official repo of "PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection"☆38Mar 27, 2025Updated last year
- [IEEE TPAMI 2025] REST: Holistic Learning for End-to-End Semantic Segmentation of Whole-Scene Remote Sensing Imagery☆38Mar 18, 2026Updated 3 weeks ago