THUDM / slimeLinks

slime is an LLM post-training framework for RL Scaling.

☆2,612

Alternatives and similar repositories for slime

Users that are interested in slime are comparing it to the libraries listed below

Sorting:

alibaba / ROLL
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
☆2,414Updated this week
Open-Reasoner-Zero / Open-Reasoner-Zero
Official Repo for Open-Reasoner-Zero
☆2,069Updated 6 months ago
NovaSky-AI / SkyRL
SkyRL: A Modular Full-stack RL Library for LLMs
☆1,287Updated this week
BytedTsinghua-SIA / DAPO
An Open-source RL System from ByteDance Seed and Tsinghua AIR
☆1,653Updated 6 months ago
inclusionAI / AReaL
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
☆3,101Updated this week
NVIDIA-NeMo / RL
Scalable toolkit for efficient model reinforcement
☆1,048Updated this week
openreasoner / openr
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
☆1,826Updated 10 months ago
PRIME-RL / PRIME
Scalable RL solution for advanced reasoning of language models
☆1,779Updated 8 months ago
MoonshotAI / Moonlight
Muon is Scalable for LLM Training
☆1,372Updated 4 months ago
huggingface / Math-Verify
☆1,015Updated 5 months ago
sail-sg / understand-r1-zero
Understanding R1-Zero-Like Training: A Critical Perspective
☆1,164Updated 3 months ago
fla-org / native-sparse-attention
🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"
☆928Updated 8 months ago
ByteDance-Seed / VeOmni
VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo
☆1,338Updated last week
ByteDance-Seed / Seed-Thinking-v1.5
☆819Updated 5 months ago
mll-lab-nu / RAGEN
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
☆2,430Updated this week
MoonshotAI / MoBA
MoBA: Mixture of Block Attention for Long-Context LLMs
☆2,007Updated 8 months ago
microsoft / rStar
☆1,351Updated 2 months ago
zhaochenyang20 / Awesome-ML-SYS-Tutorial
My learning notes/codes for ML SYS.
☆4,272Updated last week
GAIR-NLP / O1-Journey
O1 Replication Journey
☆2,002Updated 10 months ago
alibaba / Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
☆1,445Updated 2 weeks ago
SimpleBerry / LLaMA-O1
Large Reasoning Models
☆807Updated last year
zhuzilin / ring-flash-attention
Ring attention implementation with flash attention
☆923Updated 2 months ago
mlc-ai / xgrammar
Fast, Flexible and Portable Structured Generation
☆1,396Updated this week
TsinghuaC3I / Awesome-RL-for-LRMs
A Survey of Reinforcement Learning for Large Reasoning Models
☆2,088Updated 3 weeks ago
allenai / OLMoE
OLMoE: Open Mixture-of-Experts Language Models
☆916Updated 2 months ago
PeterGriffinJin / Search-R1
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
☆3,580Updated 2 weeks ago
hkust-nlp / simpleRL-reason
Simple RL training for reasoning
☆3,796Updated 4 months ago
Qihoo360 / Light-R1
☆751Updated 3 months ago
sgl-project / sgl-learning-materials
Materials for learning SGLang
☆658Updated last week
SkyworkAI / Skywork-OR1
Unleashing the Power of Reinforcement Learning for Math and Code Reasoners
☆731Updated 5 months ago