VisionXLab / LRS-VQALinks
[ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning
☆26Updated 2 months ago
Alternatives and similar repositories for LRS-VQA
Users that are interested in LRS-VQA are comparing it to the libraries listed below
Sorting:
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆51Updated 2 months ago
- ☆31Updated 7 months ago
- Code and updates for the ScoreRS project.☆23Updated 4 months ago
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆92Updated 4 months ago
- Paper list for LLM/MLLM-based image segmentation☆22Updated last week
- ☆53Updated 2 months ago
- ☆13Updated 7 months ago
- [IJCV] PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection☆28Updated last month
- ☆41Updated 6 months ago
- ☆36Updated last year
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding☆108Updated 3 weeks ago
- The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.☆55Updated 5 months ago
- ☆27Updated last month
- [ICLR 2025] Official Pytorch Implementation of MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segm…☆15Updated 3 months ago
- InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition☆52Updated last month
- Vision-Language Dataset for Remote Sensing☆33Updated last month
- [CVPR 2025 HIghlight] XLRS-Bench: ould Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?☆43Updated last month
- This is the official code for "Enhancing Perception of Key Changes in Remote Sensing Image Change Captioning"☆14Updated 9 months ago
- ☆51Updated last year
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation☆126Updated last year
- ☆20Updated 3 months ago
- This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"☆64Updated last month
- [TPAMI2024] Learning to Holistically Detect Bridges from Large-Size VHR Remote Sensing Imagery☆12Updated 3 months ago
- [TGRS 2025] Code for "PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images"☆45Updated last month
- [IGARSS 2025 Oral] A Simple Aerial Detection Baseline of Multimodal Language Models.☆79Updated last week
- The official repo for [IJCAI'24] "LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image Interpretatio…☆50Updated 8 months ago
- [ISPRS2025] SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model☆88Updated 2 months ago
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model☆100Updated 3 weeks ago
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆39Updated last month
- This is the pytorch implement of our paper "CCExpert: Advancing MLLM Capability in Remote Sensing Change Captioning with Difference-Aware…☆31Updated 7 months ago