SparrowZheyuan18 / Awesome-GeolocalizationLinks
A Paper List for Geo-localization Research
☆16Updated last year
Alternatives and similar repositories for Awesome-Geolocalization
Users that are interested in Awesome-Geolocalization are comparing it to the libraries listed below
Sorting:
- Official Github of "Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework"☆14Updated 3 weeks ago
- [ICML 2024] GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model☆66Updated 2 months ago
- [ICCV 2025] The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”☆82Updated 3 months ago
- [AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in…☆36Updated 9 months ago
- An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching☆109Updated 11 months ago
- [arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps☆71Updated last week
- Code repository for paper: "G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models"☆44Updated last month
- ☆15Updated 10 months ago
- Official implementation and datasets of AddressCLIP☆67Updated last year
- [CVPR 2024🔥] Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization☆111Updated last year
- [NeurIPS 2025] Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆89Updated 5 months ago
- [ICCV 2025] Where am I? Cross-View Geo-localization with Natural Language Descriptions.☆55Updated last month
- Official implementation of the paper "Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text Guidance" (WACV 2025)☆14Updated 10 months ago
- [ICLR 2025 Spotlight] The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multi…☆174Updated 9 months ago
- [ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs☆158Updated last month
- 😎 A curated list of CVPR 2025 Oral paper. Total 96☆59Updated last month
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆161Updated 2 months ago
- ☆132Updated 10 months ago
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆327Updated 9 months ago
- [NeurIPS 2025] Official repository for “FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models”☆28Updated last month
- Visual Planning: Let's Think Only with Images☆294Updated 8 months ago
- The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)☆39Updated 7 months ago
- Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"☆107Updated 3 weeks ago
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆103Updated 6 months ago
- The first attempt to replicate o3-like visual clue-tracking reasoning capabilities.☆64Updated 6 months ago
- The official implementation of "Segment Anything with Multiple Modalities".☆109Updated last year
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆204Updated 6 months ago
- [ICCV 2025] The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs".☆21Updated 2 months ago
- [CVPR'25] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering☆44Updated 7 months ago
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆422Updated last week