SparrowZheyuan18 / Awesome-GeolocalizationLinks
A Paper List for Geo-localization Research
☆16Updated last year
Alternatives and similar repositories for Awesome-Geolocalization
Users that are interested in Awesome-Geolocalization are comparing it to the libraries listed below
Sorting:
- Official Github of "Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework"☆12Updated 4 months ago
- [ICML 2024] GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model☆63Updated this week
- [AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in…☆35Updated 6 months ago
- Code repository for paper: "G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models"☆41Updated last month
- An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching☆101Updated 9 months ago
- 😎 A curated list of CVPR 2025 Oral paper. Total 96☆54Updated 2 months ago
- [ICCV 2025] The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”☆74Updated 2 weeks ago
- [arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps☆68Updated last week
- ☆15Updated 7 months ago
- [ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs☆107Updated 2 months ago
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆147Updated last month
- [CVPR 2024🔥] Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization☆109Updated last year
- [ICLR 2025 Spotlight] The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multi…☆167Updated 7 months ago
- ☆125Updated 7 months ago
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆284Updated 6 months ago
- [ICCV 2025] Where am I? Cross-View Geo-localization with Natural Language Descriptions.☆44Updated 3 weeks ago
- [ICCV2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary…☆127Updated last week
- [NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs☆53Updated 9 months ago
- Survey: https://arxiv.org/pdf/2507.20198☆179Updated last week
- ☆18Updated 4 months ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆195Updated 3 months ago
- [NeurIPS 2025] Official repository for “FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models”☆24Updated last month
- Official implementation and datasets of AddressCLIP☆65Updated last year
- Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆73Updated 3 months ago
- ☆18Updated last month
- ☆29Updated this week
- The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)☆38Updated 5 months ago
- Visual Planning: Let's Think Only with Images☆280Updated 5 months ago
- This repo is the official implementation of "Euclid’s Gift: Enhancing Spatial Perception and Reasoning in Vision‑Language Models via Geom…☆20Updated 2 weeks ago
- Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆56Updated 5 months ago