om-ai-lab / RS5MLinks
RS5M: a large-scale vision language dataset for remote sensing [TGRS]
β262Updated 3 months ago
Alternatives and similar repositories for RS5M
Users that are interested in RS5M are comparing it to the libraries listed below
Sorting:
- Collection of Remote Sensing Vision-Language Modelsβ137Updated last year
- π°οΈ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)β409Updated 11 months ago
- Awesome-Remote-Sensing-Vision-Language-Modelsβ170Updated last year
- VGI-Enhanced multimodal large language model for remote sensing images.β157Updated 3 months ago
- Multimodal Large Language Models for Remote Sensing (RS-MLLMs): A Surveyβ264Updated this week
- A Survey on Vision-Language Geo-Foundation Models (VLGFMs)β168Updated last month
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understandingβ105Updated last week
- RSVG: Exploring Data and Model for Visual Grounding on Remote Sensing Data, 2022β139Updated last year
- β106Updated last month
- Official repo for "SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing"β174Updated 6 months ago
- A list of awesome remote sensing image captioning resourcesβ110Updated last week
- [IEEE TGRS 2024 π₯] Change-Agent: Toward Interactive Comprehensive Remote Sensing Change Interpretation and Analysisβ129Updated 2 months ago
- [AAAI 2024] EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answeringβ122Updated 3 months ago
- Official code for TEOChat, the first vision-language assistant for temporal earth observation data (ICLR 2025).β110Updated 3 weeks ago
- Official repository for "Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery" (CVPR 2024)β108Updated 2 months ago
- [ISPRS2025] SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Modelβ87Updated last month
- β121Updated 6 months ago
- β120Updated 5 months ago
- [IEEE TGRS 2022 π₯] Remote Sensing Image Change Captioning With Dual-Branch Transformers: A New Method and a Large Scale Datasetβ124Updated last year
- β50Updated last year
- β51Updated last month
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentationβ123Updated last year
- [CVPR 2025 Oral] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Imagesβ138Updated 2 months ago
- π₯Remote Sensing Spatio-Temporal Vision-Language Models: A Comprehensive Surveyβ131Updated last week
- Changes to Captions: An Attentive Network for Remote Sensing Change Captioningβ71Updated last year
- The official repo for [JSTARS'24] "MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining"β231Updated 5 months ago
- Datasets for remote sensing images (Paper:Exploring Models and Data for Remote Sensing Image Caption Generation)β202Updated 3 years ago
- [IEEE GRSL 2024 π₯] RSCaMa: Remote Sensing Image Change Captioning with State Space Modelβ68Updated 7 months ago
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Modelβ97Updated last week
- [GRSM] Project Page for "GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing"β33Updated last month