thinking-machines-lab / tinker-cookbookLinks
Post-training with Tinker
☆2,617Updated this week
Alternatives and similar repositories for tinker-cookbook
Users that are interested in tinker-cookbook are comparing it to the libraries listed below
Sorting:
- Our library for RL environments + evals☆3,655Updated this week
- Async RL Training at Scale☆960Updated this week
- OpenAI Frontier Evals☆966Updated 3 weeks ago
- An interface library for RL post training with environments.☆859Updated this week
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,407Updated this week
- Scalable toolkit for efficient model reinforcement☆1,171Updated this week
- ☆941Updated last month
- A benchmark for LLMs on complicated tasks in the terminal☆1,260Updated this week
- A Lightweight LLM Post-Training Library☆2,055Updated this week
- Textbook on reinforcement learning from human feedback☆1,364Updated last week
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆1,236Updated last week
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆780Updated this week
- Training Large Language Model to Reason in a Continuous Latent Space☆1,411Updated 4 months ago
- ☆1,369Updated 3 months ago
- ☆2,507Updated this week
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,283Updated last week
- dLLM: Simple Diffusion Language Modeling☆1,504Updated this week
- Synthetic data curation for post-training and structured data extraction☆1,586Updated 4 months ago
- slime is an LLM post-training framework for RL Scaling.☆3,022Updated this week
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆581Updated 4 months ago
- PyTorch-native post-training at scale☆577Updated this week
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆569Updated 2 months ago
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆871Updated this week
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆710Updated last week
- A project to improve skills of large language models☆715Updated this week
- Renderer for the harmony response format to be used with gpt-oss☆4,096Updated last week
- [COLM 2025] LIMO: Less is More for Reasoning☆1,056Updated 4 months ago
- Self-Adapting Language Models☆1,620Updated 4 months ago
- bloom - evaluate any behavior immediately 🌸🌱☆640Updated this week
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆1,772Updated 4 months ago