ezelikman/quiet-star

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ezelikman/quiet-star)

ezelikman / quiet-star

Code for Quiet-STaR

☆739

Alternatives and similar repositories for quiet-star

Users that are interested in quiet-star are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

expz / quiet-star
View on GitHub
Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)
☆57Aug 8, 2024Updated last year
ezelikman / STaR
View on GitHub
Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)
☆229Feb 21, 2023Updated 3 years ago
THUDM / ReST-MCTS
View on GitHub
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
☆709Jan 20, 2025Updated last year
GAIR-NLP / O1-Journey
View on GitHub
O1 Replication Journey
☆2,001Jan 14, 2025Updated last year
MARIO-Math-Reasoning / Super_MARIO
View on GitHub
☆341Jun 5, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zhentingqi / rStar
View on GitHub
☆972Jan 23, 2025Updated last year
openreasoner / openr
View on GitHub
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
☆1,848Jan 17, 2025Updated last year
trotsky1997 / MathBlackBox
View on GitHub
☆1,033Dec 17, 2024Updated last year
RUCAIBox / Slow_Thinking_with_LLMs
View on GitHub
A series of technical report on Slow Thinking with LLM
☆767Aug 13, 2025Updated 11 months ago
microsoft / rho
View on GitHub
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
☆471Apr 18, 2024Updated 2 years ago
uclaml / SPIN
View on GitHub
The official implementation of Self-Play Fine-Tuning (SPIN)
☆1,248May 8, 2024Updated 2 years ago
YuxiXie / MCTS-DPO
View on GitHub
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
☆331Jan 29, 2026Updated 6 months ago
hkust-nlp / simpleRL-reason
View on GitHub
Simple RL training for reasoning
☆3,871Dec 23, 2025Updated 7 months ago
Open-Reasoner-Zero / Open-Reasoner-Zero
View on GitHub
Official Repo for Open-Reasoner-Zero
☆2,096Jun 2, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lqtrung1998 / mwp_ReFT
View on GitHub
☆554Jan 2, 2025Updated last year
hijkzzz / Awesome-LLM-Strawberry
View on GitHub
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
☆6,893Dec 17, 2025Updated 7 months ago
SimpleBerry / LLaMA-O1
View on GitHub
Large Reasoning Models
☆803Dec 3, 2024Updated last year
PRIME-RL / PRIME
View on GitHub
Scalable RL solution for advanced reasoning of language models
☆1,866Mar 18, 2025Updated last year
JIA-Lab-research / Step-DPO
View on GitHub
Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"
☆398Jan 19, 2025Updated last year
kanishkg / stream-of-search
View on GitHub
Repository for the paper Stream of Search: Learning to Search in Language
☆154Feb 3, 2025Updated last year
OpenBMB / Eurus
View on GitHub
☆322Sep 18, 2024Updated last year
maitrix-org / llm-reasoners
View on GitHub
A library for advanced large language model reasoning
☆2,341Jun 10, 2025Updated last year
ytyz1307zzh / RefAug
View on GitHub
Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"
☆55Oct 1, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
da03 / Internalize_CoT_Step_by_Step
View on GitHub
☆209Apr 19, 2025Updated last year
OpenRLHF / OpenRLHF
View on GitHub
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…
☆9,855Jul 14, 2026Updated 2 weeks ago
allenai / open-instruct
View on GitHub
AllenAI's post-training codebase
☆3,811Updated this week
seal-rg / recurrent-pretraining
View on GitHub
Pretraining and inference code for a large-scale depth-recurrent language model
☆903Dec 29, 2025Updated 7 months ago
Open-Source-O1 / Open-O1
View on GitHub
☆1,340Nov 21, 2024Updated last year
ATH-MaaS / Marco-o1
View on GitHub
An Open Large Reasoning Model for Real-World Solutions
☆1,537Jun 17, 2026Updated last month
arcee-ai / mergekit
View on GitHub
Tools for merging pretrained large language models.
☆7,266Jun 17, 2026Updated last month
hbin0701 / Self-Explore
View on GitHub
[𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…
☆52May 4, 2024Updated 2 years ago
huggingface / trl
View on GitHub
Train transformer language models with reinforcement learning.
☆18,953Updated this week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
YangLing0818 / buffer-of-thought-llm
View on GitHub
[NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models
☆677Jun 28, 2025Updated last year
princeton-nlp / SimPO
View on GitHub
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
☆956Feb 16, 2025Updated last year
openai / prm800k
View on GitHub
800,000 step-level correctness labels on LLM solutions to MATH problems
☆2,152Jun 1, 2023Updated 3 years ago
huggingface / Math-Verify
View on GitHub
☆1,172Jan 10, 2026Updated 6 months ago
srush / awesome-o1
View on GitHub
A bibliography and survey of the papers surrounding o1
☆1,214Jul 7, 2026Updated 3 weeks ago
SalesforceAIResearch / LaTRO
View on GitHub
☆127Jun 2, 2026Updated last month
RLHFlow / RLHF-Reward-Modeling
View on GitHub
Recipes to train reward model for RLHF.
☆1,535Apr 24, 2025Updated last year