thinking-machines-lab / tinker-cookbookLinks
Post-training with Tinker
☆1,455Updated this week
Alternatives and similar repositories for tinker-cookbook
Users that are interested in tinker-cookbook are comparing it to the libraries listed below
Sorting:
- Scalable toolkit for efficient model reinforcement☆1,009Updated this week
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,170Updated this week
- OpenAI Frontier Evals☆937Updated last week
- ☆894Updated last week
- ☆1,335Updated 2 months ago
- Environments for LLM Reinforcement Learning☆3,475Updated this week
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,214Updated last month
- Async RL Training at Scale☆749Updated this week
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆820Updated this week
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆1,144Updated last week
- An interface library for RL post training with environments.☆687Updated this week
- PyTorch-native post-training at scale☆509Updated this week
- A JAX-native LLM Post-Training Library☆1,801Updated this week
- A benchmark for LLMs on complicated tasks in the terminal☆1,041Updated this week
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆568Updated 3 months ago
- ☆1,073Updated last week
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆704Updated last month
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆616Updated 7 months ago
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆667Updated 2 weeks ago
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆743Updated this week
- Textbook on reinforcement learning from human feedback☆1,295Updated this week
- Muon is Scalable for LLM Training☆1,354Updated 3 months ago
- Training Large Language Model to Reason in a Continuous Latent Space☆1,327Updated 3 months ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆556Updated last month
- Dream 7B, a large diffusion language model☆1,054Updated last month
- OLMoE: Open Mixture-of-Experts Language Models☆899Updated last month
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆573Updated 3 months ago
- Pretraining and inference code for a large-scale depth-recurrent language model☆843Updated 3 weeks ago
- slime is an LLM post-training framework for RL Scaling.☆2,407Updated last week
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆913Updated 5 months ago