kyegomez / OpenR1Links
An open source implementation of R1
☆29Updated this week
Alternatives and similar repositories for OpenR1
Users that are interested in OpenR1 are comparing it to the libraries listed below
Sorting:
- ☆96Updated last year
- ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization☆95Updated 8 months ago
- ☆46Updated 8 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性 内容是AGI/ASI的核心。☆45Updated last year
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆62Updated 7 months ago
- ☆84Updated last year
- [ACL 2025] Agentic Knowledgeable Self-awareness☆91Updated 7 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆143Updated last year
- [ICLR 2026] Efficient Agent Training for Computer Use☆135Updated 5 months ago
- FuseAI Project☆87Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆125Updated 8 months ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆121Updated 8 months ago
- ☆84Updated last year
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆115Updated last month
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆95Updated 2 months ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆21Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Updated 2 years ago
- ☆104Updated last year
- [NeurIPS 2024] Personal Agentic AI for MultiAgent Cooperation☆87Updated last year
- ☆93Updated 8 months ago
- ☆63Updated last year
- ☆67Updated 10 months ago
- rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking☆39Updated last year
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆108Updated 8 months ago
- DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking☆49Updated last month
- Code repo for MathAgent☆19Updated 2 years ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆60Updated last year
- ☆32Updated 8 months ago