MiliLab / GeoLLaVA-8KLinks
Official repo for [NeurlPS 2025 Spotlight] "GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution"
☆34Updated last month
Alternatives and similar repositories for GeoLLaVA-8K
Users that are interested in GeoLLaVA-8K are comparing it to the libraries listed below
Sorting:
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding☆130Updated 4 months ago
- A Survey on Vision-Language Geo-Foundation Models (VLGFMs)☆174Updated 6 months ago
- ☆64Updated 6 months ago
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆106Updated 9 months ago
- ☆59Updated last month
- VGI-Enhanced multimodal large language model for remote sensing images.☆178Updated 9 months ago
- ☆30Updated 4 months ago
- [IEEE TGRS 2024 🔥] Change-Agent: Toward Interactive Comprehensive Remote Sensing Change Interpretation and Analysis☆165Updated 4 months ago
- Collection of Remote Sensing Vision-Language Models☆142Updated last year
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model☆130Updated 3 months ago
- [AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in…☆35Updated 7 months ago
- Paper List on Earth Observation in the Foundation Model Era☆27Updated this week
- Code and updates for the ScoreRS project.☆35Updated 2 months ago
- 🔥Remote Sensing SpatioTemporal Vision-Language Models: A Comprehensive Survey☆181Updated last month
- ☆43Updated 11 months ago
- [ISPRS2025] SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model☆104Updated last week
- ☆125Updated 6 months ago
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆42Updated 4 months ago
- [AAAI 2024] EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answering☆142Updated last month
- Paper list for LLM/MLLM-based image segmentation☆44Updated 3 weeks ago
- ☆36Updated last year
- ☆142Updated 11 months ago
- [GRSM] Project Page for "GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing"☆59Updated 6 months ago
- Official code for TEOChat, the first vision-language assistant for temporal earth observation data (ICLR 2025).☆126Updated last week
- RS5M: a large-scale vision language dataset for remote sensing [TGRS]☆292Updated 8 months ago
- [CVPR 2025 HIghlight] XLRS-Bench: ould Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?☆48Updated last month
- [CVPR 2025 🔥] EarthDial: Turning Multi-Sensory Earth Observations to Interactive Dialogues.☆98Updated 5 months ago
- ☆71Updated 2 weeks ago
- Vision-Language Dataset for Remote Sensing☆40Updated 6 months ago
- [CVPR 2025 Oral] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images☆222Updated 5 months ago