opendatalab / UrBenchLinks
[AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios”
☆30Updated 2 months ago
Alternatives and similar repositories for UrBench
Users that are interested in UrBench are comparing it to the libraries listed below
Sorting:
- ☆51Updated last month
- The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”☆51Updated 3 months ago
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆90Updated 4 months ago
- [CVPR 2024, Highlight] The official implementation of the paper "SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation…☆38Updated 6 months ago
- ☆31Updated 6 months ago
- A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding☆105Updated last week
- ☆12Updated last month
- A Survey on Vision-Language Geo-Foundation Models (VLGFMs)☆168Updated last month
- When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆25Updated 2 months ago
- ☆75Updated 4 months ago
- ☆19Updated 2 months ago
- Code & Dataset repository for the paper "Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation"☆58Updated 5 months ago
- Official implementation of the paper "Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text Guidance" (WACV 2025)☆10Updated 3 months ago
- ☆40Updated 5 months ago
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆51Updated last month
- [CVPR 2025 Oral] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images☆138Updated 2 months ago
- [ECCV 2024] About The official implementation of the paper "Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network“.☆83Updated last month
- SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model☆98Updated last week
- GeoPixel: A Pixel Grounding Large Multimodal Model for Remote Sensing is specifically developed for high-resolution remote sensing image …☆96Updated 3 weeks ago
- [ISPRS2025] SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model☆87Updated last month
- [JPRS'25] Official repository of the paper Cross-View Geo-Localization with Panoramic Street-View and VHR Satellite Imagery in Decentrali…☆10Updated 2 weeks ago
- Code and updates for the ScoreRS project.☆22Updated 3 months ago
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆38Updated 2 weeks ago
- Code repository for paper: "G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models"☆30Updated 2 months ago
- This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"☆64Updated 2 weeks ago
- [GRSM] Project Page for "GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing"☆33Updated last month
- Collection of Remote Sensing Vision-Language Models☆137Updated last year
- ☆50Updated last year
- [CVPR 2025] This is a model aggregated with CLIP and SAM version of SkySense for remote sensing interpretation described in SkySense-O: T…☆51Updated this week
- [TGRS 2025] Code for "PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images"☆44Updated 2 weeks ago