SkyworkAI / Skywork-OR1
Unleashing the Power of Reinforcement Learning for Math and Code Reasoners
☆540Updated 2 weeks ago
Alternatives and similar repositories for Skywork-OR1:
Users that are interested in Skywork-OR1 are comparing it to the libraries listed below
- Understanding R1-Zero-Like Training: A Critical Perspective☆908Updated 3 weeks ago
- ☆739Updated 2 weeks ago
- ☆679Updated 3 weeks ago
- Muon is Scalable for LLM Training☆1,039Updated last month
- Large Reasoning Models☆804Updated 5 months ago
- ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates☆382Updated last month
- Official Repo for Open-Reasoner-Zero☆1,904Updated last month
- LIMO: Less is More for Reasoning☆927Updated last month
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning☆808Updated last week
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆495Updated 2 weeks ago
- Explore the Multimodal “Aha Moment” on 2B Model☆583Updated last month
- [ICML 2025 Spotlight] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆515Updated 2 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆312Updated 3 weeks ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆1,698Updated this week
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆373Updated last week
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆441Updated 2 weeks ago
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,198Updated 3 weeks ago
- ☆924Updated 3 months ago
- Scalable RL solution for advanced reasoning of language models☆1,529Updated last month
- OLMoE: Open Mixture-of-Experts Language Models☆739Updated last month
- AN O1 REPLICATION FOR CODING☆333Updated 4 months ago
- ☆671Updated last week
- Distributed RL System for LLM Reasoning☆1,205Updated last week
- A series of technical report on Slow Thinking with LLM☆659Updated 3 weeks ago
- ☆287Updated last month
- ☆314Updated 7 months ago
- ☆524Updated 3 weeks ago
- Yi-1.5 is an upgraded version of Yi, delivering stronger performance in coding, math, reasoning, and instruction-following capability.☆553Updated 5 months ago
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL☆2,258Updated this week
- Collect every awesome work about r1!☆356Updated last week