MiliLab / GeoLLaVA-8KLinks
Official repo for [NeurlPS 2025 Spotlight]  "GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution"
☆29Updated 3 weeks ago
Alternatives and similar repositories for GeoLLaVA-8K
Users that are interested in GeoLLaVA-8K are comparing it to the libraries listed below
Sorting:
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding☆125Updated 2 months ago
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆103Updated 8 months ago
- A Survey on Vision-Language Geo-Foundation Models (VLGFMs)☆174Updated 5 months ago
- ☆60Updated 5 months ago
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model☆122Updated 2 months ago
- ☆56Updated last week
- Paper List on Earth Observation in the Foundation Model Era☆26Updated 3 weeks ago
- [ISPRS2025] SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model☆100Updated last month
- Official repo for "Foundation Models for Remote Sensing and Earth Observation: A Survey"☆47Updated 11 months ago
- ☆25Updated 3 months ago
- ☆123Updated 9 months ago
- VGI-Enhanced multimodal large language model for remote sensing images.☆170Updated 7 months ago
- [Nature Machine Intelligence 2025] This repository is the official implementation of the paper "A semantic-enhanced multi-modal remote se…☆121Updated last month
- ☆118Updated 4 months ago
- Code & Dataset repository for the paper "Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation"☆66Updated 9 months ago
- [TGRS 2025] Code for "PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images"☆54Updated 4 months ago
- RS-Agent: An LLM-driven Remote Sensing Intelligent Agent☆45Updated 5 months ago
- [AAAI 2024] EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answering☆137Updated 2 weeks ago
- [ACMMM-25] Official repo of "RemoteSAM: Towards Segment Anything for Earth Observation"☆178Updated last month
- Paper list for LLM/MLLM-based image segmentation☆35Updated this week
- [CVPR 2025] Towards Satellite Image Road Graph Extraction: A Global-Scale Dataset and A Novel Method☆63Updated 7 months ago
- DynamicEarth: How Far are We from Open-Vocabulary Change Detection?☆87Updated 5 months ago
- Collection of Remote Sensing Vision-Language Models☆141Updated last year
- [CVPR 2025 🔥] EarthDial: Turning Multi-Sensory Earth Observations to Interactive Dialogues.☆87Updated 4 months ago
- GeoPixel: A Pixel Grounding Large Multimodal Model for Remote Sensing is specifically developed for high-resolution remote sensing image …☆124Updated 5 months ago
- 🔥Remote Sensing SpatioTemporal Vision-Language Models: A Comprehensive Survey☆166Updated last month
- ☆42Updated 9 months ago
- ☆119Updated 3 months ago
- Official code for TEOChat, the first vision-language assistant for temporal earth observation data (ICLR 2025).☆119Updated 5 months ago
- 🔥🔥First Multi-granularity, multi-sensor, multi-scale LMM for Earth Observation.☆82Updated 2 weeks ago