VoyagerXvoyagerx / InstructSAMLinks
InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition (NeurIPS 2025)
☆105Updated last month
Alternatives and similar repositories for InstructSAM
Users that are interested in InstructSAM are comparing it to the libraries listed below
Sorting:
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model☆136Updated 2 weeks ago
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆77Updated 8 months ago
- SegEarth-OV3: Exploring SAM 3 for Open-Vocabulary Semantic Segmentation in Remote Sensing Images☆116Updated this week
- [AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"☆223Updated 2 months ago
- [ACM MM 25] Official repo of "RemoteSAM: Towards Segment Anything for Earth Observation"☆197Updated last week
- A Scene Graph-Enhanced Remote Sensing Large Vision-Language Model☆133Updated this week
- [CVPR 2025 Oral] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images☆232Updated 6 months ago
- This is a official code repository of ROS-SAM☆65Updated 9 months ago
- ☆44Updated last year
- The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.☆65Updated 11 months ago
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆44Updated 5 months ago
- [IEEE GRSM 2025 🔥] "Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model…☆159Updated this week
- [IGARSS 2025 Oral] A Simple Aerial Detection Baseline of Multimodal Language Models.☆88Updated 6 months ago
- This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"☆84Updated 7 months ago
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation☆146Updated last year
- [TPAMI 2025] CrossEarth: Geospatial Vision Foundation Model for Cross-Domain Generalization in Remote Sensing Semantic Segmentation☆172Updated 3 weeks ago
- [CVPR 2025 🔥] EarthDial: Turning Multi-Sensory Earth Observations to Interactive Dialogues.☆106Updated 6 months ago
- Vision-Language Dataset for Remote Sensing☆39Updated 7 months ago
- [TPAMI] Oriented object detection on STAR dataset.☆87Updated 11 months ago
- [IJCV] PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection☆40Updated 3 months ago
- [TGRS 2025] Code for "PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images"☆60Updated 2 months ago
- ☆92Updated last month
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆108Updated 10 months ago
- ☆39Updated last year
- Paper list for LLM/MLLM-based image segmentation☆46Updated 3 weeks ago
- [GRSM] Project Page for "GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing"☆61Updated 8 months ago
- EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing☆19Updated 7 months ago
- ☆128Updated 11 months ago
- Code & Dataset repository for the paper "Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation"☆77Updated last month
- [ISPRS2025] SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model☆116Updated last month