opendatalab / VHMLinks
VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis
☆98Updated 6 months ago
Alternatives and similar repositories for VHM
Users that are interested in VHM are comparing it to the libraries listed below
Sorting:
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding☆120Updated last month
- ☆57Updated 4 months ago
- ☆32Updated 9 months ago
- ☆54Updated last year
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆67Updated 4 months ago
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆34Updated last month
- [ISPRS2025] SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model☆95Updated last week
- A Survey on Vision-Language Geo-Foundation Models (VLGFMs)☆174Updated 3 months ago
- ☆41Updated 8 months ago
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model☆111Updated 3 weeks ago
- Collection of Remote Sensing Vision-Language Models☆139Updated last year
- ☆133Updated 9 months ago
- RS5M: a large-scale vision language dataset for remote sensing [TGRS]☆281Updated 6 months ago
- Code and updates for the ScoreRS project.☆27Updated 6 months ago
- The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.☆59Updated 7 months ago
- VGI-Enhanced multimodal large language model for remote sensing images.☆168Updated 6 months ago
- InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition☆65Updated 3 months ago
- Official repo for "GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution"☆22Updated 2 months ago
- Vision-Language Dataset for Remote Sensing☆35Updated 3 months ago
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation☆136Updated last year
- ☆122Updated 7 months ago
- ☆36Updated last year
- [IGARSS 2025 Oral] A Simple Aerial Detection Baseline of Multimodal Language Models.☆84Updated 2 months ago
- GeoPixel: A Pixel Grounding Large Multimodal Model for Remote Sensing is specifically developed for high-resolution remote sensing image …☆114Updated 3 months ago
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆40Updated 3 months ago
- [IEEE GRSM 2025 🔥] "Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model…☆133Updated 3 months ago
- RSVG: Exploring Data and Model for Visual Grounding on Remote Sensing Data, 2022☆151Updated last year
- Paper list for LLM/MLLM-based image segmentation☆30Updated 2 weeks ago
- Code & Dataset repository for the paper "Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation"☆62Updated 8 months ago
- ☆118Updated last month