AI9Stars / XLRS-BenchLinks
[CVPR 2025 HIghlight] XLRS-Bench: ould Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?
☆51Updated 2 months ago
Alternatives and similar repositories for XLRS-Bench
Users that are interested in XLRS-Bench are comparing it to the libraries listed below
Sorting:
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆46Updated 5 months ago
- Code and updates for the ScoreRS project.☆38Updated 4 months ago
- ☆44Updated last year
- ☆66Updated last month
- The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.☆65Updated 11 months ago
- ☆36Updated last year
- A Scene Graph-Enhanced Remote Sensing Large Vision-Language Model☆133Updated this week
- ☆21Updated last year
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆108Updated 11 months ago
- Paper list for LLM/MLLM-based image segmentation☆46Updated 3 weeks ago
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation☆146Updated last year
- RSVG: Exploring Data and Model for Visual Grounding on Remote Sensing Data, 2022☆169Updated last month
- This is the pytorch implement of our paper "CCExpert: Advancing MLLM Capability in Remote Sensing Change Captioning with Difference-Aware…☆36Updated last year
- ☆92Updated last month
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model☆136Updated 3 weeks ago
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆50Updated 7 months ago
- Vision-Language Dataset for Remote Sensing☆39Updated 7 months ago
- Awesome Referring Remote Sensing Image Segmentation☆27Updated 3 months ago
- [GRSM] Project Page for "GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing"☆62Updated 8 months ago
- Official PyTorch Implementation of SARLANG-1M: A Benchmark for Vision-Language Modeling in SAR Image Understanding [IEEE TGRS 2026].☆40Updated this week
- ☆90Updated last month
- ☆61Updated 2 months ago
- ☆21Updated last year
- This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"☆84Updated 7 months ago
- This is the implement of the paper "RSRefSeg 2: Decoupling Referring Remote Sensing Image Segmentation with Foundation Models"☆24Updated 5 months ago
- 📖A curated list of VLMs Paper with codes in RS, Advancements in Visual Language Models for Remote Sensing: Datasets, Capabilities, and E…☆14Updated last year
- ☆39Updated last year
- Official repo for [NeurlPS 2025 Spotlight] "GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution"☆42Updated 2 months ago
- [ISPRS2025] SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model☆116Updated last month
- VGI-Enhanced multimodal large language model for remote sensing images.☆181Updated 10 months ago