ZiruiSongBest / GeocompLinks
Official Github of "Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework"
☆14Updated 3 weeks ago
Alternatives and similar repositories for Geocomp
Users that are interested in Geocomp are comparing it to the libraries listed below
Sorting:
- A Paper List for Geo-localization Research☆16Updated last year
- [AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in…☆36Updated 9 months ago
- Code repository for paper: "G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models"☆44Updated last month
- ☆15Updated 10 months ago
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆46Updated 5 months ago
- VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis☆109Updated 11 months ago
- ☆39Updated last year
- 🔥🔥First Multi-granularity, multi-sensor, multi-scale LMM for Earth Observation.☆91Updated 3 months ago
- The first attempt to replicate o3-like visual clue-tracking reasoning capabilities.☆64Updated 6 months ago
- [ICML 2024] GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model☆66Updated 3 months ago
- ☆66Updated last month
- The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)☆39Updated 8 months ago
- [CVPR 2025 HIghlight] XLRS-Bench: ould Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?☆52Updated 2 months ago
- Official implementation of the ICCV 2025 paper HoliTracer.☆37Updated 2 weeks ago
- Code and updates for the ScoreRS project.☆38Updated 4 months ago
- [ICCV 2025] UrbanLLaVA: A Multi-modal Large Language Model for Urban Intelligence with Spatial Reasoing and Understanding.☆67Updated 3 months ago
- ☆92Updated 2 months ago
- [ICCV 2025] The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”☆82Updated 3 months ago
- [NeurIPS 2025] Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆90Updated 6 months ago
- [ICLR2025] Text4Seg: Reimagining Image Segmentation as Text Generation☆162Updated 2 months ago
- ☆26Updated 4 months ago
- [arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps☆71Updated 2 weeks ago
- Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"☆108Updated last month
- [CVPR 2024🔥] Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization☆112Updated last year
- Better, Stronger, Faster: Tackling the Trilemma in MLLM-based Segmentation with Simultaneous Textual Mask Prediction☆31Updated 3 weeks ago
- Pixel-Level Reasoning Model trained with RL [NeuIPS25]☆267Updated 2 months ago
- ☆12Updated last year
- InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition (NeurIPS 2025)☆107Updated this week
- Official code repository of Shuffle-R1☆25Updated 5 months ago
- A Scene Graph-Enhanced Remote Sensing Large Vision-Language Model☆134Updated last week