R1-Vision: Let's first take a look at the image
☆48Feb 16, 2025Updated last year
Alternatives and similar repositories for R1-Vision
Users that are interested in R1-Vision are comparing it to the libraries listed below
Sorting:
- R1-onevision, a visual language model capable of deep CoT reasoning.☆577Apr 13, 2025Updated 10 months ago
- [CVPR 2026] MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources☆216Sep 26, 2025Updated 5 months ago
- A fork to add multimodal model training to open-r1☆1,496Feb 8, 2025Updated last year
- Latest Advances on Reasoning of Multimodal Large Language Models (Multimodal R1 \ Visual R1) ) 🍓☆36Apr 3, 2025Updated 11 months ago
- MM-Eureka V0 also called R1-Multimodal-Journey, Latest version is in MM-Eureka☆324Jun 21, 2025Updated 8 months ago
- [ICLR2026] This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that…☆776Jan 26, 2026Updated last month
- ✨First Open-Source R1-like Video-LLM [2025/02/18]☆382Feb 23, 2025Updated last year
- TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models☆37Nov 10, 2024Updated last year
- MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning☆771Sep 7, 2025Updated 6 months ago
- Witness the aha moment of VLM with less than $3.☆4,036May 19, 2025Updated 9 months ago
- Metis-RISE: RL Incentivizes and SFT Enhances Multimodal Reasoning Model Learning☆23Jun 26, 2025Updated 8 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆55Mar 21, 2025Updated 11 months ago
- Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs☆22May 7, 2025Updated 10 months ago
- NeurIPS文章+代码(最新2021更新)☆22Nov 24, 2021Updated 4 years ago
- This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-bas…☆1,365Feb 26, 2026Updated last week
- ☆58Feb 27, 2026Updated last week
- Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]☆835Dec 14, 2025Updated 2 months ago
- ☆27Jul 20, 2024Updated last year
- Encourage Medical LLM to engage in deep thinking similar to DeepSeek-R1.☆26Apr 24, 2025Updated 10 months ago
- SFT+RL boosts multimodal reasoning☆46Jun 27, 2025Updated 8 months ago
- ☆111Sep 11, 2025Updated 5 months ago
- ☆25May 13, 2024Updated last year
- A high content analysis method to study skeletal muscle☆12Apr 26, 2024Updated last year
- SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward☆92Aug 8, 2025Updated 7 months ago
- ☆39Aug 4, 2025Updated 7 months ago
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- Official Repository: A Comprehensive Benchmark for Logical Reasoning in MLLMs☆45Jun 17, 2025Updated 8 months ago
- Latest open-source "Thinking with images" (O3/O4-mini) papers, covering training-free, SFT-based, and RL-enhanced methods for "fine-grain…☆111Aug 21, 2025Updated 6 months ago
- A RLHF Infrastructure for Vision-Language Models☆197Nov 15, 2024Updated last year
- A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.☆73Mar 18, 2025Updated 11 months ago
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆65Sep 27, 2025Updated 5 months ago
- [ICLR 2025 Spotlight] OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text☆413May 5, 2025Updated 10 months ago
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆73Apr 22, 2025Updated 10 months ago
- Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’☆2,308Oct 29, 2025Updated 4 months ago
- ☆1,137Nov 20, 2025Updated 3 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆183Jun 5, 2025Updated 9 months ago
- Official Code for All-in-One Medical Image Re-Identification (CVPR2025)☆18Jan 11, 2026Updated last month
- Latest Advances on (RL based) Multimodal Reasoning and Generation in Multimodal Large Language Models☆47Oct 30, 2025Updated 4 months ago
- ☆98Jun 23, 2025Updated 8 months ago