shuyansy / EarthMindLinks
π₯π₯First Multi-granularity, multi-sensor, multi-scale LMM for Earth Observation.
β77Updated 4 months ago
Alternatives and similar repositories for EarthMind
Users that are interested in EarthMind are comparing it to the libraries listed below
Sorting:
- [CVPR 2025 π₯] EarthDial: Turning Multi-Sensory Earth Observations to Interactive Dialogues.β83Updated 3 months ago
- GeoPixel: A Pixel Grounding Large Multimodal Model for Remote Sensing is specifically developed for high-resolution remote sensing image β¦β121Updated 4 months ago
- [Official Repo] CrossEarth: Geospatial Vision Foundation Model for Cross-Domain Generalization in Remote Sensing Semantic Segmentationβ158Updated 9 months ago
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understandingβ120Updated 2 months ago
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Modelβ116Updated last month
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysisβ102Updated 7 months ago
- β58Updated 4 months ago
- β45Updated 5 months ago
- Official repo for "Foundation Models for Remote Sensing and Earth Observation: A Survey"β47Updated 10 months ago
- A collection of papers related to Geo-spatial Information Science in NeurIPS 2024.β54Updated 9 months ago
- A Survey on Vision-Language Geo-Foundation Models (VLGFMs)β174Updated 4 months ago
- ThinkGeo is a Comprehensive Benchmark to evaluate Tool-Augmented Agents for Remote Sensing Tasksβ43Updated 2 months ago
- Vision-Language Dataset for Remote Sensingβ38Updated 4 months ago
- β40Updated last year
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruningβ35Updated 2 months ago
- [arXiv, 2024] Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detectionβ30Updated 3 months ago
- Official code for TEOChat, the first vision-language assistant for temporal earth observation data (ICLR 2025).β118Updated 4 months ago
- The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.β59Updated 7 months ago
- Official implementation of the RSE paper mKGR.β18Updated last month
- β16Updated 3 weeks ago
- β33Updated 10 months ago
- A PyTorch implementation of "GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis"β98Updated 10 months ago
- Landsat-Bench: Datasets and Benchmarks for Landsat Foundation Modelsβ16Updated 3 months ago
- β41Updated 9 months ago
- [Nature Machine Intelligence 2025] This repository is the official implementation of the paper "A semantic-enhanced multi-modal remote seβ¦β107Updated 3 weeks ago
- β137Updated 9 months ago
- Official repo for [NeurlPS 2025 Spotlight] "GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution"β25Updated 3 weeks ago
- RS-Agent: An LLM-driven Remote Sensing Intelligent Agentβ38Updated 4 months ago
- β55Updated last year
- β74Updated last year