thinking-machines-lab / tinker-cookbookLinks
Post-training with Tinker
☆2,313Updated this week
Alternatives and similar repositories for tinker-cookbook
Users that are interested in tinker-cookbook are comparing it to the libraries listed below
Sorting:
- Environments for LLM Reinforcement Learning☆3,573Updated this week
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,287Updated last week
- OpenAI Frontier Evals☆951Updated this week
- Async RL Training at Scale☆867Updated this week
- ☆917Updated last month
- Scalable toolkit for efficient model reinforcement☆1,054Updated this week
- An interface library for RL post training with environments.☆789Updated this week
- ☆1,351Updated 2 months ago
- A benchmark for LLMs on complicated tasks in the terminal☆1,137Updated last week
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,242Updated 3 weeks ago
- Training Large Language Model to Reason in a Continuous Latent Space☆1,367Updated 3 months ago
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆1,199Updated last week
- Recipes to scale inference-time compute of open models☆1,118Updated 6 months ago
- slime is an LLM post-training framework for RL Scaling.☆2,612Updated last week
- Pretraining and inference code for a large-scale depth-recurrent language model☆850Updated last month
- ☆2,467Updated last month
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆624Updated 8 months ago
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆851Updated last week
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆760Updated this week
- PyTorch-native post-training at scale☆549Updated last week
- Synthetic data curation for post-training and structured data extraction☆1,564Updated 4 months ago
- Renderer for the harmony response format to be used with gpt-oss☆4,033Updated last month
- A JAX-native LLM Post-Training Library☆1,951Updated this week
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,164Updated 3 months ago
- Muon is Scalable for LLM Training☆1,372Updated 4 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,162Updated this week
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,172Updated 10 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,911Updated 3 months ago
- Textbook on reinforcement learning from human feedback☆1,338Updated last week
- τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment☆503Updated this week