THUDM / slimeLinks
slime is a LLM post-training framework for RL Scaling.
☆1,652Updated this week
Alternatives and similar repositories for slime
Users that are interested in slime are comparing it to the libraries listed below
Sorting:
- SkyRL: A Modular Full-stack RL Library for LLMs☆818Updated this week
- An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models☆1,888Updated last week
- Official Repo for Open-Reasoner-Zero☆2,033Updated 3 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,076Updated 2 weeks ago
- ☆921Updated 2 months ago
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,537Updated 4 months ago
- Muon is Scalable for LLM Training☆1,302Updated last month
- ☆812Updated 3 months ago
- Ring attention implementation with flash attention☆864Updated last month
- Scalable toolkit for efficient model reinforcement☆843Updated this week
- Distributed RL System for LLM Reasoning☆2,569Updated this week
- Scalable RL solution for advanced reasoning of language models☆1,716Updated 5 months ago
- Fast inference from large lauguage models via speculative decoding☆814Updated last year
- Large Reasoning Models☆805Updated 9 months ago
- [NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention…☆1,124Updated last month
- OLMoE: Open Mixture-of-Experts Language Models☆857Updated 5 months ago
- Scalable toolkit for efficient model alignment☆839Updated last month
- 🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"☆847Updated 5 months ago
- Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3.☆1,635Updated this week
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆313Updated 4 months ago
- [COLM 2025] LIMO: Less is More for Reasoning☆1,015Updated last month
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,279Updated this week
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆438Updated 3 months ago
- ☆958Updated 7 months ago
- ☆741Updated last week
- Fast, Flexible and Portable Structured Generation☆1,215Updated this week
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆708Updated 3 months ago
- Materials for learning SGLang☆562Updated last week
- Recipes to scale inference-time compute of open models☆1,111Updated 3 months ago
- VeOmni: Scaling any Modality Model Training to any Accelerators with PyTorch native Training Framework☆1,050Updated 2 weeks ago