TheEighthDay / SeekWorldView external linksLinks
The first attempt to replicate o3-like visual clue-tracking reasoning capabilities.
☆64Jul 8, 2025Updated 7 months ago
Alternatives and similar repositories for SeekWorld
Users that are interested in SeekWorld are comparing it to the libraries listed below
Sorting:
- [ICML 2024] GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model☆68Feb 1, 2026Updated 2 weeks ago
- Official implementation of the RSE paper mKGR.☆20Jan 15, 2026Updated last month
- Official implementation of the ICCV 2025 paper HoliTracer.☆40Jan 13, 2026Updated last month
- [ISPRS P&RS'25] Official repository of the paper Cross-View Geo-Localization with Panoramic Street-View and VHR Satellite Imagery in Dece…☆19Nov 10, 2025Updated 3 months ago
- Research works from Tencent AI Lab regarding self-evolving agents☆82Jan 30, 2026Updated 2 weeks ago
- ☆23Apr 19, 2024Updated last year
- [arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps☆72Feb 6, 2026Updated last week
- ☆132Mar 22, 2025Updated 10 months ago
- Universal Video Temporal Grounding with Generative Multi-modal Large Language Models☆46Nov 25, 2025Updated 2 months ago
- [CVPR2025] SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories☆91Aug 8, 2025Updated 6 months ago
- GroundCUA☆67Dec 24, 2025Updated last month
- ☆31Feb 8, 2023Updated 3 years ago
- Code and updates for the ScoreRS project.☆40Sep 19, 2025Updated 4 months ago
- 📚 A collection of resources and papers on Large Language Models in autonomous driving☆27Oct 30, 2023Updated 2 years ago
- Official implementation and datasets of AddressCLIP☆66Jul 4, 2024Updated last year
- 中文到表情☆31May 12, 2022Updated 3 years ago
- ☆124Nov 1, 2025Updated 3 months ago
- Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding [ICML 2025]]☆45Jul 22, 2025Updated 6 months ago
- Visual Spatial Tuning☆173Feb 1, 2026Updated 2 weeks ago
- NeurIPS 2025: Discriminative Constrained Optimization for Reinforcing Large Reasoning Models☆50Feb 3, 2026Updated last week
- A collection of papers related to Geo-spatial Information Science in CVPR 2025.☆38Apr 1, 2025Updated 10 months ago
- Simplifies data migration between Apache Ignite clusters by relying on Apache Avro as an intermediate storage format☆13Jun 27, 2023Updated 2 years ago
- ☆10May 19, 2025Updated 8 months ago
- 是APEX贡献的一个基于大数据平台能力的数据开发平台,帮助企业以最小成本实现链接数据,构建和沉淀数仓模型,降低数据应用门槛,沉淀数据价值。☆12Oct 31, 2024Updated last year
- [WACV 2025] 🌍🚗 SpaGBOL: Spatial-Graph-Based Orientated Localisation 📡🗺️☆14Apr 9, 2025Updated 10 months ago
- [ICLR 2026] The official implementation of the paper “Earth-Agent: Unlocking the Full Landscape of Earth Observation with Agents”☆95Feb 1, 2026Updated 2 weeks ago
- ☆33Jan 18, 2023Updated 3 years ago
- Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition☆41Dec 5, 2024Updated last year
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆47Updated this week
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- build vgg16 with pytorch 0.4.0 for classification of CIFAR datasets☆10Mar 31, 2019Updated 6 years ago
- Collaborative Discourse Manager☆11Nov 6, 2016Updated 9 years ago
- A simple exam generator and grader written in Python with OpenCV☆14Jan 14, 2026Updated last month
- ซอร์สโค้ดและไฟล์ต่างๆสำหรับหนังสือ "คู่มือเขียนแอพ Android ด้วย Android Studio"☆10Oct 4, 2015Updated 10 years ago
- Building a multi-agent RAG system with advanced RAG methods☆12Jan 12, 2025Updated last year
- Azure Machine Learning - MLOps Python SDKv2☆10Jul 24, 2023Updated 2 years ago
- [ECCV 2024] FlexAttention for Efficient High-Resolution Vision-Language Models☆46Jan 8, 2025Updated last year
- ☆181May 6, 2024Updated last year
- first attempt at description2code from 2016☆10Nov 15, 2018Updated 7 years ago