SparrowZheyuan18 / Awesome-GeolocalizationLinks
A Paper List for Geo-localization Research
☆16Updated last year
Alternatives and similar repositories for Awesome-Geolocalization
Users that are interested in Awesome-Geolocalization are comparing it to the libraries listed below
Sorting:
- Official Github of "Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework"☆14Updated 3 weeks ago
- [ICML 2024] GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model☆66Updated 2 months ago
- An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching☆107Updated 11 months ago
- [AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in…☆35Updated 8 months ago
- Code repository for paper: "G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models"☆44Updated last week
- ☆15Updated 9 months ago
- [arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps☆70Updated last month
- [CVPR 2024🔥] Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization☆111Updated last year
- [ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs☆145Updated 2 weeks ago
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆156Updated last month
- 😎 A curated list of CVPR 2025 Oral paper. Total 96☆59Updated 3 weeks ago
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆310Updated 8 months ago
- Visual Planning: Let's Think Only with Images☆289Updated 7 months ago
- [NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs☆57Updated 11 months ago
- ☆133Updated 9 months ago
- Official implementation and datasets of AddressCLIP☆67Updated last year
- The first attempt to replicate o3-like visual clue-tracking reasoning capabilities.☆62Updated 5 months ago
- Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆84Updated 5 months ago
- [ICCV 2025] The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”☆82Updated 2 months ago
- Survey: https://arxiv.org/pdf/2507.20198☆257Updated last week
- [NeurIPS 2025] Official repository for “FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models”☆26Updated 3 weeks ago
- ☆36Updated 5 months ago
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆100Updated 5 months ago
- MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence☆40Updated this week
- Code for Retrieval-Augmented Perception (ICML 2025)☆66Updated 4 months ago
- [ICCV 2025] Where am I? Cross-View Geo-localization with Natural Language Descriptions.☆55Updated 2 weeks ago
- The official repo for "Where do Large Vision-Language Models Look at when Answering Questions?"☆49Updated 7 months ago
- Official implementation of the paper "Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text Guidance" (WACV 2025)☆14Updated 9 months ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆201Updated 5 months ago
- ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models☆66Updated 7 months ago