zhaochen0110 / OpenThinkIMGLinks
OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.
☆315Updated 4 months ago
Alternatives and similar repositories for OpenThinkIMG
Users that are interested in OpenThinkIMG are comparing it to the libraries listed below
Sorting:
- Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"☆342Updated last month
- MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources☆203Updated last month
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆335Updated 2 months ago
- [CVPR2025 Highlight] Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models☆224Updated 3 months ago
- 🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal rei…☆185Updated last week
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]