VoyagerXvoyagerx / InstructSAMLinks
InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition (NeurIPS 2025)
☆107Updated last week
Alternatives and similar repositories for InstructSAM
Users that are interested in InstructSAM are comparing it to the libraries listed below
Sorting:
- SegEarth-OV3: Exploring SAM 3 for Open-Vocabulary Semantic Segmentation in Remote Sensing Images☆136Updated 3 weeks ago
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆78Updated 8 months ago
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model☆140Updated 2 weeks ago
- [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"☆225Updated 3 months ago
- [ACM MM 25] Official repo of "RemoteSAM: Towards Segment Anything for Earth Observation"☆204Updated last month
- This is a official code repository of ROS-SAM☆66Updated 9 months ago
- [TPAMI 2025] CrossEarth: Geospatial Vision Foundation Model for Cross-Domain Generalization in Remote Sensing Semantic Segmentation☆175Updated last month
- A Scene Graph-Enhanced Remote Sensing Large Vision-Language Model☆134Updated 2 weeks ago
- [CVPR 2025 Oral] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images☆240Updated 6 months ago
- [IEEE GRSM 2025 🔥] "Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model…☆162Updated 3 weeks ago
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆109Updated 11 months ago
- This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"☆84Updated last week
- ☆41Updated last year
- ☆44Updated last year
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆46Updated 6 months ago
- The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.☆65Updated 11 months ago
- Vision-Language Dataset for Remote Sensing☆40Updated 8 months ago
- [ICLR 2026] The official implementation of the paper “Earth-Agent: Unlocking the Full Landscape of Earth Observation with Agents”☆93Updated this week
- ☆66Updated last month
- Code and updates for the ScoreRS project.☆39Updated 4 months ago
- [TGRS 2025] Code for "PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images"☆62Updated 3 months ago
- EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing☆20Updated 8 months ago
- [ISPRS2025] SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model☆119Updated 2 months ago
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation☆150Updated last year
- ☆130Updated last year
- [GRSM] Project Page for "GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing"☆63Updated 8 months ago
- [AAAI 2024] EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answering☆152Updated last week
- ☆144Updated 3 weeks ago
- [AAAI 2026 Oral] DynamicEarth: How Far are We from Open-Vocabulary Change Detection?☆107Updated last month
- [IGARSS 2025 Oral] A Simple Aerial Detection Baseline of Multimodal Language Models.☆90Updated 2 weeks ago