SparrowZheyuan18 / Awesome-GeolocalizationLinks
A Paper List for Geo-localization Research
☆16Updated last year
Alternatives and similar repositories for Awesome-Geolocalization
Users that are interested in Awesome-Geolocalization are comparing it to the libraries listed below
Sorting:
- Official Github of "Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework"☆14Updated last week
- [ICML 2024] GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model☆65Updated last month
- [AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in…☆35Updated 8 months ago
- [ICCV 2025] The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”☆80Updated last month
- An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching☆107Updated 10 months ago
- Code repository for paper: "G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models"☆41Updated 2 months ago
- [CVPR 2024🔥] Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization☆110Updated last year
- [ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs☆143Updated 4 months ago
- Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆82Updated 4 months ago
- [ICCV 2025] Where am I? Cross-View Geo-localization with Natural Language Descriptions.☆52Updated this week
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆154Updated last month
- [arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps☆70Updated last month
- ☆20Updated 3 months ago
- Official implementation and datasets of AddressCLIP☆67Updated last year
- Survey: https://arxiv.org/pdf/2507.20198☆235Updated last month
- [NeurIPS 2025] Official repository for “FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models”☆25Updated this week
- Visual Planning: Let's Think Only with Images☆284Updated 6 months ago
- [ICCV 2025] The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs".☆21Updated last month
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆306Updated 7 months ago
- ☆132Updated 8 months ago
- ☆15Updated 8 months ago
- Official implementation of the paper "Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text Guidance" (WACV 2025)☆13Updated 9 months ago
- The first attempt to replicate o3-like visual clue-tracking reasoning capabilities.☆61Updated 5 months ago
- [ICCV 2025] LIRA☆21Updated 2 weeks ago
- The official repo for "Where do Large Vision-Language Models Look at when Answering Questions?"☆50Updated 6 months ago
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆99Updated 5 months ago
- [ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models☆77Updated 9 months ago
- [CVPR 2025 Highlight] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆51Updated 3 months ago
- [LLaVA-Video-R1]✨First Adaptation of R1 to LLaVA-Video (2025-03-18)☆35Updated 7 months ago
- [NeurIPS 2025] Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration☆99Updated last week