opendatalab / UrBenchLinks
[AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios”
☆31Updated 3 months ago
Alternatives and similar repositories for UrBench
Users that are interested in UrBench are comparing it to the libraries listed below
Sorting:
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆92Updated 4 months ago
- ☆53Updated 2 months ago
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding☆108Updated 3 weeks ago
- ☆31Updated 7 months ago
- [ICCV 2025] The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”☆56Updated 2 weeks ago
- ☆14Updated 2 months ago
- A Survey on Vision-Language Geo-Foundation Models (VLGFMs)☆168Updated last month
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆26Updated 2 months ago
- ☆41Updated 6 months ago
- [ISPRS2025] SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model☆88Updated 2 months ago
- [CVPR 2025 Oral] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images☆149Updated last week
- [ICCV 2025] Where am I? Cross-View Geo-localization with Natural Language Descriptions.☆26Updated last week
- GeoPixel: A Pixel Grounding Large Multimodal Model for Remote Sensing is specifically developed for high-resolution remote sensing image …☆100Updated last month
- [CVPR 2024, Highlight] The official implementation of the paper "SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation…☆40Updated 7 months ago
- Code repository for paper: "G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models"☆32Updated 2 months ago
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆51Updated 2 months ago
- InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition☆52Updated last month
- VGI-Enhanced multimodal large language model for remote sensing images.☆161Updated 4 months ago
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model☆100Updated 3 weeks ago
- ☆75Updated 5 months ago
- This repository is the official implementation of the paper "SkySense++: A Semantic-Enhanced Multi-Modal Remote Sensing Foundation Model …☆21Updated 2 months ago
- ☆122Updated 6 months ago
- Collection of Remote Sensing Vision-Language Models☆138Updated last year
- ☆110Updated last month
- An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching☆93Updated 5 months ago
- The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.☆55Updated 5 months ago
- 🔥🔥First Multi-granularity, multi-sensor, multi-scale LMM for Earth Observation.☆71Updated last month
- ☆51Updated last year
- [IEEE GRSM 2025 🔥] "Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model…☆115Updated last month
- [IEEE TGRS 2024 🔥] Change-Agent: Toward Interactive Comprehensive Remote Sensing Change Interpretation and Analysis☆132Updated 3 months ago