xinyan-cxy / MINT-CoTView external linksLinks
[NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning
☆101Sep 19, 2025Updated 4 months ago
Alternatives and similar repositories for MINT-CoT
Users that are interested in MINT-CoT are comparing it to the libraries listed below
Sorting:
- ☆15Mar 18, 2025Updated 10 months ago
- ☆61Dec 5, 2025Updated 2 months ago
- Official repository for "TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving"☆23Sep 1, 2025Updated 5 months ago
- ☆64Feb 1, 2026Updated 2 weeks ago
- ☆18May 14, 2024Updated last year
- Are Video Models Ready as Zero-shot Reasoners?☆84Nov 24, 2025Updated 2 months ago
- [ICLR 2025] Mathematical Visual Instruction Tuning for Multi-modal Large Language Models☆152Dec 5, 2024Updated last year
- MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency☆136Aug 5, 2025Updated 6 months ago
- 同济大学计算机系课程《编译原理》大作业项目。包含词法分析器,LR1语法分析器。☆14Jun 25, 2023Updated 2 years ago
- [ECCV 2024] Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?☆176Apr 28, 2025Updated 9 months ago
- This is the Repository for Geometry Problem Solving Method Evaluation☆26Oct 8, 2024Updated last year
- Official code repository of Shuffle-R1☆35Jan 27, 2026Updated 2 weeks ago
- Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…☆1,329Feb 3, 2026Updated last week
- A real-time swarf detection and analysis system based on YOLO and Qwen-vl-max, providing efficient video stream processing and intelligen…☆40Aug 5, 2025Updated 6 months ago
- [NeurIPS 2025] Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆90Jul 27, 2025Updated 6 months ago
- Modality Gap–Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models☆48Updated this week
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆29May 22, 2025Updated 8 months ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆24Sep 9, 2024Updated last year
- Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"☆401Jan 29, 2026Updated 2 weeks ago
- ☆30Feb 6, 2026Updated last week
- Curated list of recent visual autoregressive (VAR) modeling works☆30Mar 17, 2025Updated 10 months ago
- The first Interleaved framework for textual reasoning within the visual generation process☆157Nov 21, 2025Updated 2 months ago
- [CVPR2025] BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding☆38Feb 5, 2026Updated last week
- Official repository of "Chatting Makes Perfect: Chat-based Image Retrieval"☆31Feb 5, 2025Updated last year
- This repository shares undergraduate course materials for the Electronic Information Engineering program at the University of Science and…☆63Oct 23, 2025Updated 3 months ago
- Code for our Paper "All in an Aggregated Image for In-Image Learning"☆29Apr 9, 2024Updated last year
- [CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation☆855May 23, 2025Updated 8 months ago
- Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025)☆240Aug 2, 2025Updated 6 months ago
- [CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training☆102Jul 18, 2025Updated 6 months ago
- ☆123Oct 3, 2025Updated 4 months ago
- A fork to add multimodal model training to open-r1☆1,474Feb 8, 2025Updated last year
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆106Dec 30, 2025Updated last month
- 同济大学数字逻辑大作业☆34Jan 18, 2022Updated 4 years ago
- Visual Planning: Let's Think Only with Images☆299May 20, 2025Updated 8 months ago
- ☆65Jan 7, 2026Updated last month
- MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]☆22Dec 10, 2025Updated 2 months ago
- Comparative Study and Implementation of Five Factor Model and Myers-Briggs Type Indicator Model☆11Sep 28, 2023Updated 2 years ago
- The first OpenSource Mafia Bot!☆10Oct 5, 2023Updated 2 years ago
- Multiprocessing in python☆10Aug 20, 2021Updated 4 years ago