SimpleBerry / LLaMA-O1
Large Reasoning Models
☆799Updated 3 months ago
Alternatives and similar repositories for LLaMA-O1:
Users that are interested in LLaMA-O1 are comparing it to the libraries listed below
- ☆905Updated last month
- A series of technical report on Slow Thinking with LLM☆536Updated this week
- Scalable RL solution for advanced reasoning of language models☆1,381Updated 3 weeks ago
- O1 Replication Journey☆1,969Updated last month
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆1,712Updated last month
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆841Updated 3 weeks ago
- ☆494Updated 2 months ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆589Updated last month
- OLMoE: Open Mixture-of-Experts Language Models☆666Updated 2 months ago
- An Open Large Reasoning Model for Real-World Solutions☆1,472Updated last week
- ☆1,340Updated 3 months ago
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆648Updated last month
- Recipes to scale inference-time compute of open models☆1,035Updated 2 weeks ago
- AN O1 REPLICATION FOR CODING☆329Updated 3 months ago
- ☆1,008Updated 2 months ago
- Recipes to train reward model for RLHF.☆1,237Updated last month
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆694Updated last week
- Code for Quiet-STaR☆721Updated 6 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆320Updated last month