tanhuajie/Reason-RFT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tanhuajie/Reason-RFT)

tanhuajie / Reason-RFT

[NeurIPS 2025]⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.

☆440

Alternatives and similar repositories for Reason-RFT

Users that are interested in Reason-RFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

uclanlp / OpenVLThinker
View on GitHub
OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement
☆152May 25, 2026Updated last month
Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs
View on GitHub
This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-bas…
☆1,426May 11, 2026Updated last month
real-absolute-AI / NoisyRollout
View on GitHub
[NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
☆110Sep 18, 2025Updated 9 months ago
EmbodiedCity / Embodied-R.code
View on GitHub
☆95May 15, 2025Updated last year
FlagOpen / ShareRobot
View on GitHub
☆62Apr 1, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
AMAP-ML / GPG
View on GitHub
[ICLR26]GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning
☆180Jan 29, 2026Updated 5 months ago
FlagOpen / RoboBrain
View on GitHub
[CVPR 2025] RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete. Official Repository.
☆554Oct 13, 2025Updated 8 months ago
om-ai-lab / VLM-R1
View on GitHub
Solve Visual Understanding with Reinforced VLMs
☆5,991Mar 12, 2026Updated 3 months ago
Osilly / Vision-R1
View on GitHub
[ICLR2026] This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that…
☆1,474Mar 20, 2026Updated 3 months ago
EvolvingLMMs-Lab / open-r1-multimodal
View on GitHub
A fork to add multimodal model training to open-r1
☆1,576Feb 8, 2025Updated last year
VLM-RL / Ocean-R1
View on GitHub
☆26Apr 9, 2025Updated last year
Liuziyu77 / Visual-RFT
View on GitHub
Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’
☆2,250Oct 29, 2025Updated 8 months ago
hq-King / Affordance-R1
View on GitHub
code for affordance-r1
☆73May 11, 2026Updated last month
Fancy-MLLM / R1-Onevision
View on GitHub
R1-onevision, a visual language model capable of deep CoT reasoning.
☆581Apr 13, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
TIGER-AI-Lab / VL-Rethinker
View on GitHub
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]
☆190Jun 5, 2025Updated last year
ModalMinds / MM-EUREKA
View on GitHub
MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning
☆771Sep 7, 2025Updated 9 months ago
cheliu-computation / AlphaMed-NeurIPSW
View on GitHub
Unleashing Reasoning in Medical Large Language Models
☆12Mar 19, 2025Updated last year
jinpeng0528 / SEFE
View on GitHub
☆13May 6, 2025Updated last year
TideDra / lmm-r1
View on GitHub
Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
☆846May 14, 2025Updated last year
zwq2018 / embodied_reasoner
View on GitHub
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks
☆201Apr 9, 2026Updated 2 months ago
FlagOpen / RoboBrain-X0
View on GitHub
☆116Oct 27, 2025Updated 8 months ago
shengjun-zhang / VisualGRPO
View on GitHub
E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models
☆43Jan 5, 2026Updated 5 months ago
deepglint / MLCD-Seg
View on GitHub
MLCD-Seg is a zero-shot segmentation model from DeepGlint.
☆18Jul 4, 2025Updated 11 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
turningpoint-ai / VisualThinker-R1-Zero
View on GitHub
Explore the Multimodal “Aha Moment” on 2B Model
☆623Mar 18, 2025Updated last year
OpenDCAI / Awesome_MLLMs_Reasoning
View on GitHub
☆110Sep 11, 2025Updated 9 months ago
StarsfieldAI / R1-V
View on GitHub
Witness the aha moment of VLM with less than $3.
☆4,060May 19, 2025Updated last year
maifoundations / Streamo
View on GitHub
Streaming Video Instruction Tuning
☆76Feb 25, 2026Updated 4 months ago
mc-lan / ClearCLIP
View on GitHub
[ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference
☆100Mar 26, 2025Updated last year
kxfan2002 / SophiaVL-R1
View on GitHub
SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward
☆95Aug 8, 2025Updated 10 months ago
minglllli / CLS-RL
View on GitHub
[NeurIPS 2025 Spotlight] Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning
☆87Sep 19, 2025Updated 9 months ago
HKUST-LongGroup / Relation-R1
View on GitHub
[AAAI 2026] Relation-R1: Progressively Cognitive Chain-of-Thought Guided Reinforcement Learning for Unified Relation Comprehension
☆21Mar 6, 2026Updated 3 months ago
UCSC-VLAA / VLAA-Thinking
View on GitHub
[TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models
☆149Oct 10, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
LengSicong / MMR1
View on GitHub
[CVPR 2026] MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources
☆218Sep 26, 2025Updated 9 months ago
MikeWangWZHL / PAPO
View on GitHub
Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"
☆148Feb 4, 2026Updated 4 months ago
songw-zju / PixelThink
View on GitHub
The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (ICML 2026)
☆43Updated this week
tulerfeng / Video-R1
View on GitHub
Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]
☆878Dec 14, 2025Updated 6 months ago
yuhui-zh15 / AutoConverter
View on GitHub
Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…
☆40May 26, 2025Updated last year
MiliLab / AnesSuite
View on GitHub
Official repo for [ICLR 2026] "AnesSuite: A Comprehensive Benchmark and Dataset Suite for Anesthesiology Reasoning in LLMs"
☆25Feb 28, 2026Updated 4 months ago
PKU-HMI-Lab / Hybrid-VLA
View on GitHub
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
☆351Oct 3, 2025Updated 8 months ago