opendatalab / UrBenchLinks
[AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios”
☆35Updated 6 months ago
Alternatives and similar repositories for UrBench
Users that are interested in UrBench are comparing it to the libraries listed below
Sorting:
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆103Updated 8 months ago
- ☆29Updated this week
- [GRSM] Project Page for "GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing"☆54Updated 5 months ago
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding☆125Updated 2 months ago
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆38Updated 2 months ago
- ☆33Updated 10 months ago
- ☆60Updated 5 months ago
- A Survey on Vision-Language Geo-Foundation Models (VLGFMs)☆174Updated 5 months ago
- ☆42Updated 9 months ago
- VGI-Enhanced multimodal large language model for remote sensing images.☆170Updated 7 months ago
- [CVPR 2025 HIghlight] XLRS-Bench: ould Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?☆48Updated 2 months ago
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model☆122Updated 2 months ago
- ☆18Updated last month
- Paper list for LLM/MLLM-based image segmentation☆35Updated this week
- [ICCV 2025] The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”☆74Updated last week
- [CVPR 2025 Oral] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images☆196Updated 3 months ago
- ☆118Updated 4 months ago
- Code repository for paper: "G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models"☆41Updated last month
- Code and updates for the ScoreRS project.☆29Updated last month
- [ISPRS2025] SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model☆100Updated last month
- Official repo for [NeurlPS 2025 Spotlight] "GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution"☆29Updated 3 weeks ago
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆147Updated last month
- [ICCV 2025] Where am I? Cross-View Geo-localization with Natural Language Descriptions.☆44Updated 3 weeks ago
- An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching☆101Updated 9 months ago
- RS5M: a large-scale vision language dataset for remote sensing [TGRS]☆286Updated 7 months ago
- Official implementation of the paper "Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text Guidance" (WACV 2025)☆13Updated 7 months ago
- ☆86Updated 8 months ago
- [IEEE TGRS 2024 🔥] Change-Agent: Toward Interactive Comprehensive Remote Sensing Change Interpretation and Analysis☆153Updated 3 months ago
- RSVG: Exploring Data and Model for Visual Grounding on Remote Sensing Data, 2022☆156Updated 3 weeks ago
- Collection of Remote Sensing Vision-Language Models☆141Updated last year