opendatalab / VHMLinks
VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis
☆92Updated 6 months ago
Alternatives and similar repositories for VHM
Users that are interested in VHM are comparing it to the libraries listed below
Sorting:
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding☆118Updated 3 weeks ago
- ☆56Updated 3 months ago
- ☆53Updated last year
- ☆41Updated 7 months ago
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆57Updated 3 months ago
- ☆32Updated 8 months ago
- A Survey on Vision-Language Geo-Foundation Models (VLGFMs)☆172Updated 3 months ago
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model☆105Updated 3 weeks ago
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆33Updated 3 weeks ago
- ☆129Updated 8 months ago
- [ISPRS2025] SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model☆93Updated 3 months ago
- ☆121Updated 7 months ago
- VGI-Enhanced multimodal large language model for remote sensing images.☆166Updated 5 months ago
- Collection of Remote Sensing Vision-Language Models☆139Updated last year
- InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition☆64Updated 3 months ago
- Vision-Language Dataset for Remote Sensing☆35Updated 3 months ago
- Code and updates for the ScoreRS project.☆27Updated 5 months ago
- The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.☆60Updated 6 months ago
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation☆131Updated last year
- [AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in…☆34Updated 4 months ago
- This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"☆66Updated 2 months ago
- ☆36Updated last year
- ☆113Updated 2 months ago
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆40Updated 2 months ago
- RS5M: a large-scale vision language dataset for remote sensing [TGRS]☆277Updated 5 months ago
- [IEEE GRSM 2025 🔥] "Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model…☆129Updated 2 months ago
- GeoPixel: A Pixel Grounding Large Multimodal Model for Remote Sensing is specifically developed for high-resolution remote sensing image …☆111Updated 3 months ago
- Paper list for LLM/MLLM-based image segmentation☆28Updated 2 weeks ago
- ☆117Updated last month
- Official code for TEOChat, the first vision-language assistant for temporal earth observation data (ICLR 2025).☆115Updated 2 months ago