srush / awesome-o1Links

A bibliography and survey of the papers surrounding o1

☆1,207

Alternatives and similar repositories for awesome-o1

Users that are interested in awesome-o1 are comparing it to the libraries listed below

Sorting:

huggingface / search-and-learn
Recipes to scale inference-time compute of open models
☆1,110Updated 2 months ago
open-thought / reasoning-gym
procedural reasoning datasets
☆1,012Updated this week
facebookresearch / coconut
Training Large Language Model to Reason in a Continuous Latent Space
☆1,224Updated 6 months ago
sail-sg / understand-r1-zero
Understanding R1-Zero-Like Training: A Critical Perspective
☆1,055Updated last week
ContextualAI / HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
☆877Updated 3 weeks ago
huggingface / Math-Verify
☆870Updated last month
NovaSky-AI / SkyRL
SkyRL: A Modular Full-stack RL Library for LLMs
☆679Updated this week
allenai / reward-bench
RewardBench: the first evaluation tool for reward models.
☆619Updated last month
open-thought / system-2-research
System 2 Reasoning Link Collection
☆849Updated 4 months ago
sail-sg / oat
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
☆425Updated last week
trotsky1997 / MathBlackBox
☆1,028Updated 7 months ago
McGill-NLP / nano-aha-moment
Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
☆512Updated 3 weeks ago
NVIDIA / NeMo-Aligner
Scalable toolkit for efficient model alignment
☆833Updated last week
mlfoundations / evalchemy
Automatic evals for LLMs
☆496Updated last month
microsoft / rStar
☆608Updated 3 weeks ago
willccbb / verifiers
Verifiers for LLM Reinforcement Learning
☆1,690Updated this week
princeton-nlp / SimPO
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
☆912Updated 5 months ago
GAIR-NLP / LIMO
[COLM 2025] LIMO: Less is More for Reasoning
☆993Updated last week
SimpleBerry / LLaMA-O1
Large Reasoning Models
☆804Updated 8 months ago
andyzoujm / representation-engineering
Representation Engineering: A Top-Down Approach to AI Transparency
☆854Updated 11 months ago
jzhang38 / EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
☆739Updated 10 months ago
huggingface / lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
☆1,793Updated this week
huggingface / nanotron
Minimalistic large language model 3D-parallelism training
☆2,101Updated 3 weeks ago
seal-rg / recurrent-pretraining
Pretraining and inference code for a large-scale depth-recurrent language model
☆808Updated 2 weeks ago
zhentingqi / rStar
☆954Updated 6 months ago
THUDM / ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
☆654Updated 6 months ago
GAIR-NLP / O1-Journey
O1 Replication Journey
☆1,998Updated 6 months ago
NVIDIA / NeMo-Skills
A project to improve skills of large language models
☆501Updated this week
openai / sparse_autoencoder
☆505Updated last year
XueFuzhao / OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
☆1,571Updated last year