This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!
☆57Mar 21, 2025Updated last year
Alternatives and similar repositories for Mind_with_eyes_Awesome_MLLMs_Reasoning
Users that are interested in Mind_with_eyes_Awesome_MLLMs_Reasoning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The first unified, efficient, and extensible evaluation toolkit for evaluating image generation and editing models across multiple benchm…☆39Mar 11, 2026Updated 2 weeks ago
- Official code repository for Med-TTT.☆17Jun 30, 2025Updated 9 months ago
- Accepted LLM Papers in NeurIPS 2024☆37Oct 13, 2024Updated last year
- The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”☆17Feb 26, 2024Updated 2 years ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆156Dec 24, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆108Dec 30, 2025Updated 3 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆131Mar 18, 2025Updated last year
- Some papers on Knowledge Graph Embedding(KGE)☆13Aug 16, 2022Updated 3 years ago
- AN O1 REPLICATION FOR CODING☆333Dec 11, 2024Updated last year
- R1-Vision: Let's first take a look at the image☆48Feb 16, 2025Updated last year
- A library of visualization tools for the interpretability and hallucination analysis of large vision-language models (LVLMs).☆41May 22, 2025Updated 10 months ago
- An exploration of LLM steering☆25Jun 15, 2024Updated last year
- ☆12Jul 16, 2025Updated 8 months ago
- MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency☆137Aug 5, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆262Oct 17, 2025Updated 5 months ago
- [NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆75May 31, 2025Updated 9 months ago
- ☆14May 9, 2024Updated last year
- This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".☆84Jul 10, 2025Updated 8 months ago
- 🔥Awesome Multimodal Large Language Models Paper List☆154Mar 12, 2025Updated last year
- GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents☆19Feb 26, 2026Updated last month
- Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks☆36Feb 24, 2026Updated last month
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆16Feb 22, 2025Updated last year
- ☆19May 19, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆14Oct 17, 2024Updated last year
- Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding [ICML 2025]]☆47Jul 22, 2025Updated 8 months ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆13Jul 27, 2025Updated 8 months ago
- ☆16Oct 9, 2024Updated last year
- This repository has been redirected into https://kuaisar.github.io/.☆11Oct 12, 2023Updated 2 years ago
- [ACL 2024] Implementation for Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation☆15Oct 9, 2025Updated 5 months ago
- ☆19Dec 6, 2023Updated 2 years ago
- DeepMedic starter code in pytorch☆31Jan 9, 2018Updated 8 years ago
- Collections of Papers and Projects for Multimodal Reasoning.☆108Apr 25, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Official Implementation for paper "Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm"☆21Mar 18, 2026Updated last week
- Disrupting Diffusion: Token-Level Attention Erasure Attack against Diffusion-based Customization(ACM MM2024)☆18Mar 31, 2025Updated 11 months ago
- [ICCV 2025 Highlight] The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"☆196Mar 17, 2025Updated last year
- Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"☆618Jan 17, 2026Updated 2 months ago
- [NeurIPS 2025] The official PyTorch implementation of the "Vision Function Layer in MLLM".☆28Dec 18, 2025Updated 3 months ago
- ☆16Sep 4, 2025Updated 6 months ago
- Greentown Smart Home Command Language Large Model(SmartHomeCLLM), trained from tens of thousands of smart home control commands 智能家居指令大模型…☆18Mar 15, 2024Updated 2 years ago