hughbzhang / o1_inference_scaling_lawsLinks

Replicating O1 inference-time scaling laws

☆90

Alternatives and similar repositories for o1_inference_scaling_laws

Users that are interested in o1_inference_scaling_laws are comparing it to the libraries listed below

Sorting:

JacobPfau / fillerTokens
☆74Updated last year
casmlab / NPHardEval
Repository for NPHardEval, a quantified-dynamic benchmark of LLMs
☆59Updated last year
SalesforceAIResearch / LaTRO
☆122Updated 8 months ago
mlfoundations / scaling
Language models scale reliably with over-training and on downstream tasks
☆100Updated last year
allenai / easy-to-hard-generalization
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Updated last year
ScalingIntelligence / large_language_monkeys
☆108Updated last year
SynthLabsAI / big-math
A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
☆65Updated 8 months ago
architsharma97 / dpo-rlaif
☆100Updated last year
da03 / Internalize_CoT_Step_by_Step
☆195Updated 6 months ago
ahans30 / goldfish-loss
[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs
☆92Updated 11 months ago
sail-sg / SkyLadder
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆35Updated 2 weeks ago
TIGER-AI-Lab / LongICLBench
Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]
☆109Updated 8 months ago
Yu-Fangxu / FoR
[ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples
☆108Updated 3 months ago
Leooyii / LCEG
Long Context Extension and Generalization in LLMs
☆62Updated last year
kyegomez / Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
☆111Updated last week
huggingface / ioi
☆40Updated 7 months ago
da03 / implicit_chain_of_thought
☆138Updated 11 months ago
katiekang1998 / reasoning_generalization
☆33Updated 9 months ago
hamishivi / automated-instruction-selection
Exploration of automated dataset selection approaches at large scales.
☆48Updated 7 months ago
hkust-nlp / PreSelect
[ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches
☆56Updated 7 months ago
RobertCsordas / moeut
☆86Updated last year
hkust-nlp / llm-compression-intelligence
Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]
☆142Updated last year
hamishivi / EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…
☆75Updated last year
princeton-nlp / USACO
Can Language Models Solve Olympiad Programming?
☆119Updated 9 months ago
GAIR-NLP / AIME-Preview
☆75Updated 7 months ago
TIGER-AI-Lab / CritiqueFineTuning
Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]
☆178Updated 3 months ago
martin-wey / CodeUltraFeedback
CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)
☆71Updated last year
THUDM / T1
RL Scaling and Test-Time Scaling (ICML'25)
☆111Updated 9 months ago
google-deepmind / bbeh
☆99Updated 5 months ago
efficientscaling / Z1
[EMNLP'2025 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"
☆66Updated 6 months ago