yuyq96 / R1-VisionView external linksLinks
R1-Vision: Let's first take a look at the image
☆48Feb 16, 2025Updated 11 months ago
Alternatives and similar repositories for R1-Vision
Users that are interested in R1-Vision are comparing it to the libraries listed below
Sorting:
- MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources☆215Sep 26, 2025Updated 4 months ago
- A fork to add multimodal model training to open-r1☆1,449Feb 8, 2025Updated last year
- Latest Advances on Reasoning of Multimodal Large Language Models (Multimodal R1 \ Visual R1) ) 🍓☆36Apr 3, 2025Updated 10 months ago
- MM-Eureka V0 also called R1-Multimodal-Journey, Latest version is in MM-Eureka☆322Jun 21, 2025Updated 7 months ago
- Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.☆840May 14, 2025Updated 9 months ago
- [ICLR2026] This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that…☆760Jan 26, 2026Updated 2 weeks ago
- ✨First Open-Source R1-like Video-LLM [2025/02/18]☆381Feb 23, 2025Updated 11 months ago
- TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models☆37Nov 10, 2024Updated last year
- MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning☆768Sep 7, 2025Updated 5 months ago
- Witness the aha moment of VLM with less than $3.☆4,029May 19, 2025Updated 8 months ago
- Metis-RISE: RL Incentivizes and SFT Enhances Multimodal Reasoning Model Learning☆23Jun 26, 2025Updated 7 months ago
- Explore the Multimodal “Aha Moment” on 2B Model☆623Mar 18, 2025Updated 10 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆54Mar 21, 2025Updated 10 months ago
- Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs☆22May 7, 2025Updated 9 months ago
- NeurIPS文章+代码(最新2021更新)☆22Nov 24, 2021Updated 4 years ago
- This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-bas…☆1,349Dec 7, 2025Updated 2 months ago
- Solve Visual Understanding with Reinforced VLMs☆5,833Oct 21, 2025Updated 3 months ago
- An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.☆155Jan 5, 2026Updated last month
- Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]☆816Dec 14, 2025Updated 2 months ago
- ☆27Jul 20, 2024Updated last year
- Encourage Medical LLM to engage in deep thinking similar to DeepSeek-R1.☆26Apr 24, 2025Updated 9 months ago
- A high content analysis method to study skeletal muscle☆12Apr 26, 2024Updated last year
- SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward☆91Aug 8, 2025Updated 6 months ago
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- Latest open-source "Thinking with images" (O3/O4-mini) papers, covering training-free, SFT-based, and RL-enhanced methods for "fine-grain…☆110Aug 21, 2025Updated 5 months ago
- A RLHF Infrastructure for Vision-Language Models☆196Nov 15, 2024Updated last year
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆64Sep 27, 2025Updated 4 months ago
- [ICLR 2025 Spotlight] OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text☆412May 5, 2025Updated 9 months ago
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆74Apr 22, 2025Updated 9 months ago
- Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’☆2,316Oct 29, 2025Updated 3 months ago
- ☆1,122Nov 20, 2025Updated 2 months ago
- Your efficient and accurate answer verification system for RL training.☆41Jun 23, 2025Updated 7 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆182Jun 5, 2025Updated 8 months ago
- VideoNSA: Native Sparse Attention Scales Video Understanding☆78Nov 16, 2025Updated 2 months ago
- Latest Advances on (RL based) Multimodal Reasoning and Generation in Multimodal Large Language Models☆47Oct 30, 2025Updated 3 months ago
- Official Code for All-in-One Medical Image Re-Identification (CVPR2025)☆18Jan 11, 2026Updated last month
- ☆97Jun 23, 2025Updated 7 months ago
- Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…☆38Dec 19, 2024Updated last year
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆89Feb 17, 2025Updated 11 months ago