om-ai-lab / ZoomEyeView external linksLinks
[EMNLP-2025 Oral] ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration
☆72Nov 20, 2025Updated 2 months ago
Alternatives and similar repositories for ZoomEye
Users that are interested in ZoomEye are comparing it to the libraries listed below
Sorting:
- A suite of multimodal language models that are powerful and efficient☆17Jan 13, 2025Updated last year
- A collection of strong multimodal models for building multimodal AGI agents☆44Jul 9, 2024Updated last year
- Reproducible Language Agent Research☆33Jun 25, 2025Updated 7 months ago
- Geo-OLMs Repo: Accepted to ACM COMPASS 2025☆19Jun 17, 2025Updated 7 months ago
- A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)☆62May 7, 2024Updated last year
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated 11 months ago
- PyTorch Implementation of "Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Larg…☆39Dec 5, 2025Updated 2 months ago
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models☆21Jan 29, 2025Updated last year
- Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models☆277Aug 5, 2025Updated 6 months ago
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated 10 months ago
- Implementation and dataset for paper "Can MLLMs Perform Text-to-Image In-Context Learning?"☆42Jun 2, 2025Updated 8 months ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 5 months ago
- ☆14Mar 20, 2025Updated 10 months ago
- ☆17Nov 28, 2025Updated 2 months ago
- Official code for NeurIPS 2025 paper "GRIT: Teaching MLLMs to Think with Images"☆173Jan 16, 2026Updated 3 weeks ago
- Enhancing Ultrahigh Resolution Remote Sensing Imagery Analysis With ImageRAG [GRSM]☆28Feb 4, 2026Updated last week
- [arXiv, 2024] Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection☆34Jul 2, 2025Updated 7 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Feb 4, 2026Updated last week
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆106Dec 30, 2025Updated last month
- MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs☆35Jan 18, 2026Updated 3 weeks ago
- Elastic Workplace Search Official Python Client☆10Aug 8, 2024Updated last year
- An up-to-date & curated list of awesome layout to image papers, methods & resources.☆13Jun 28, 2024Updated last year
- Under construction☆13Jan 15, 2025Updated last year
- ☆12Dec 4, 2024Updated last year
- Deep Learning for Video Retrieval by Natural Language☆11Oct 20, 2019Updated 6 years ago
- CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts☆162Jun 8, 2024Updated last year
- Official PyTorch implementation of “MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation”☆18Dec 5, 2024Updated last year
- ☆21Jul 21, 2025Updated 6 months ago
- ☆12Aug 21, 2024Updated last year
- More reliable Video Understanding Evaluation☆14Sep 23, 2025Updated 4 months ago
- ☆19Jun 4, 2025Updated 8 months ago
- ☆54Jan 17, 2025Updated last year
- [Technical Report] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with …☆63Oct 9, 2024Updated last year
- ☆32Mar 7, 2022Updated 3 years ago
- Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples☆40Nov 27, 2024Updated last year
- Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding [ICML 2025]]☆45Jul 22, 2025Updated 6 months ago
- ☆16Mar 26, 2025Updated 10 months ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 8 months ago
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆21Jun 15, 2025Updated 7 months ago