VisionXLab / LRS-VQALinks
[ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning
☆34Updated last month
Alternatives and similar repositories for LRS-VQA
Users that are interested in LRS-VQA are comparing it to the libraries listed below
Sorting:
- Code and updates for the ScoreRS project.☆27Updated 6 months ago
- ☆32Updated 9 months ago
- [CVPR 2025 HIghlight] XLRS-Bench: ould Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?☆45Updated last month
- ☆36Updated last year
- Paper list for LLM/MLLM-based image segmentation☆30Updated 2 weeks ago
- ☆57Updated 4 months ago
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆67Updated 4 months ago
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆98Updated 6 months ago
- ☆41Updated 8 months ago
- Vision-Language Dataset for Remote Sensing☆35Updated 3 months ago
- ☆11Updated last year
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding☆120Updated last month
- InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition☆65Updated 3 months ago
- [TPAMI2024] Learning to Holistically Detect Bridges from Large-Size VHR Remote Sensing Imagery☆13Updated 6 months ago
- ☆15Updated 9 months ago
- The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.☆59Updated 7 months ago
- ☆83Updated 7 months ago
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model☆111Updated 3 weeks ago
- [IJCV] PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection☆35Updated 2 months ago
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆40Updated 3 months ago
- ☆24Updated last month
- This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"☆70Updated 3 months ago
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation☆133Updated last year
- SARLANG-1M is a large-scale benchmark tailored for multimodal SAR image understanding, with a primary focus on integrating SAR with textu…☆31Updated 2 months ago
- ☆18Updated last year
- Code & Dataset repository for the paper "Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation"☆62Updated 8 months ago
- This is the pytorch implement of our paper "CCExpert: Advancing MLLM Capability in Remote Sensing Change Captioning with Difference-Aware…☆33Updated 9 months ago
- This is the pytorch implement of the paper "RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models"☆62Updated last month
- The official repo for [IJCAI'24] "LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image Interpretatio…☆53Updated 10 months ago
- EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing☆16Updated 3 months ago