VisionXLab / LRS-VQALinks
[ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning
☆39Updated 3 months ago
Alternatives and similar repositories for LRS-VQA
Users that are interested in LRS-VQA are comparing it to the libraries listed below
Sorting:
- Code and updates for the ScoreRS project.☆34Updated 2 months ago
- ☆35Updated 11 months ago
- ☆64Updated 6 months ago
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆73Updated 6 months ago
- [CVPR 2025 HIghlight] XLRS-Bench: ould Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?☆48Updated 2 weeks ago
- ☆43Updated 10 months ago
- Offical implementation of "Visual Instruction Pretraining for Domain-Specific Foundation Models"☆95Updated last week
- Paper list for LLM/MLLM-based image segmentation☆37Updated last week
- ☆36Updated last year
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆106Updated 9 months ago
- Vision-Language Dataset for Remote Sensing☆39Updated 5 months ago
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding☆125Updated 3 months ago
- [TPAMI2024] Learning to Holistically Detect Bridges from Large-Size VHR Remote Sensing Imagery☆14Updated 8 months ago
- ☆19Updated 11 months ago
- ☆11Updated last year
- The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.☆63Updated 9 months ago
- ☆88Updated 9 months ago
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation☆141Updated last year
- InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition (NeurIPS 2025)☆97Updated last month
- [IJCV] PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection☆39Updated last month
- ☆19Updated last year
- This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"☆78Updated 5 months ago
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆46Updated 5 months ago
- ☆54Updated 2 weeks ago
- [IEEE GRSM 2025 🔥] "Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model…☆148Updated 2 months ago
- SARLANG-1M is a large-scale benchmark tailored for multimodal SAR image understanding, with a primary focus on integrating SAR with textu…☆36Updated 4 months ago
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model☆126Updated 2 months ago
- ☆29Updated 3 months ago
- Official Code for “EarthSynth: Generating Informative Earth Observation with Diffusion Models”☆50Updated 2 weeks ago
- ☆57Updated 3 weeks ago