hiyouga / EasyR1
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
☆2,315Updated this week
Alternatives and similar repositories for EasyR1:
Users that are interested in EasyR1 are comparing it to the libraries listed below
- Reproduce R1 Zero on Logic Puzzle☆2,331Updated last month
- A fork to add multimodal model training to open-r1☆1,252Updated 3 months ago
- Official Repo for Open-Reasoner-Zero☆1,912Updated last month
- O1 Replication Journey☆1,987Updated 3 months ago
- A very simple GRPO implement for reproducing r1-like LLM thinking.☆1,029Updated last month
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆1,765Updated 3 months ago
- Simple RL training for reasoning☆3,540Updated last month
- Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.☆748Updated last week
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,219Updated last month
- Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks☆2,320Updated this week
- Latest Advances on System-2 Reasoning☆989Updated 2 weeks ago
- ☆684Updated 3 weeks ago
- Distributed RL System for LLM Reasoning☆1,229Updated 2 weeks ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆1,772Updated this week
- Scalable RL solution for advanced reasoning of language models☆1,537Updated last month
- Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’☆1,648Updated 3 weeks ago
- An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)☆6,595Updated this week
- MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning☆592Updated last week
- Large Reasoning Models☆804Updated 5 months ago
- Awesome RL Reasoning Recipes ("Triple R")☆520Updated this week
- Explore the Multimodal “Aha Moment” on 2B Model☆585Updated last month
- This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-bas…☆740Updated this week
- Witness the aha moment of VLM with less than $3.☆3,642Updated 2 months ago
- Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities☆829Updated 3 weeks ago
- R1-onevision, a visual language model capable of deep CoT reasoning.☆514Updated 3 weeks ago
- ☆526Updated 4 months ago
- A Framework of Small-scale Large Multimodal Models☆812Updated 2 weeks ago
- This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-sta…☆550Updated this week
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) an…☆7,450Updated this week
- MoBA: Mixture of Block Attention for Long-Context LLMs☆1,771Updated last month