opendatalab / UrBenchLinks
[AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios”
☆33Updated 3 months ago
Alternatives and similar repositories for UrBench
Users that are interested in UrBench are comparing it to the libraries listed below
Sorting:
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆92Updated 5 months ago
- ☆55Updated 2 months ago
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding☆115Updated this week
- A Survey on Vision-Language Geo-Foundation Models (VLGFMs)☆170Updated 2 months ago
- ☆41Updated 7 months ago
- [CVPR 2025 Oral] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images☆165Updated 3 weeks ago
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆32Updated last week
- ☆31Updated 7 months ago
- VGI-Enhanced multimodal large language model for remote sensing images.☆162Updated 5 months ago
- [GRSM] Project Page for "GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing"☆41Updated 2 months ago
- ☆15Updated 2 months ago
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model☆104Updated last month
- 🔥🔥First Multi-granularity, multi-sensor, multi-scale LMM for Earth Observation.☆75Updated 2 months ago
- [IEEE GRSM 2025 🔥] "Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model…☆121Updated 2 months ago
- Code & Dataset repository for the paper "Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation"☆59Updated 7 months ago
- [ICCV 2025] The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”☆66Updated 3 weeks ago
- ☆111Updated 2 months ago
- This repository is the official implementation of the paper "SkySense++: A Semantic-Enhanced Multi-Modal Remote Sensing Foundation Model …☆22Updated 2 months ago
- [CVPR 2025 HIghlight] XLRS-Bench: ould Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?☆43Updated 2 months ago
- The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.☆58Updated 5 months ago
- ☆79Updated 5 months ago
- Paper list for LLM/MLLM-based image segmentation☆25Updated 3 weeks ago
- ☆52Updated last year
- [ISPRS2025] SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model☆89Updated 2 months ago
- [ACMMM-25] Official repo of "RemoteSAM: Towards Segment Anything for Earth Observation"☆132Updated last month
- RS5M: a large-scale vision language dataset for remote sensing [TGRS]☆270Updated 4 months ago
- GeoPixel: A Pixel Grounding Large Multimodal Model for Remote Sensing is specifically developed for high-resolution remote sensing image …☆104Updated 2 months ago
- Official implementation of the paper "Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text Guidance" (WACV 2025)☆11Updated 5 months ago
- Official repo for "Foundation Models for Remote Sensing and Earth Observation: A Survey"☆44Updated 8 months ago
- [AAAI 2024] EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answering☆130Updated 5 months ago