thinking-machines-lab / tinker-cookbookLinks
Post-training with Tinker
☆2,719Updated this week
Alternatives and similar repositories for tinker-cookbook
Users that are interested in tinker-cookbook are comparing it to the libraries listed below
Sorting:
- Our library for RL environments + evals☆3,730Updated last week
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,456Updated this week
- An interface library for RL post training with environments.☆1,004Updated this week
- Scalable toolkit for efficient model reinforcement☆1,227Updated this week
- Async RL Training at Scale☆1,005Updated this week
- OpenAI Frontier Evals☆983Updated last month
- ☆1,376Updated 4 months ago
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆1,276Updated this week
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,304Updated last month
- A benchmark for LLMs on complicated tasks in the terminal☆1,350Updated 3 weeks ago
- A Lightweight LLM Post-Training Library☆2,106Updated this week
- Textbook on reinforcement learning from human feedback☆1,416Updated this week
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆825Updated this week
- ☆949Updated 2 months ago
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆654Updated 10 months ago
- dLLM: Simple Diffusion Language Modeling☆1,566Updated last week
- General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.☆1,230Updated this week
- Renderer for the harmony response format to be used with gpt-oss☆4,135Updated last month
- Recipes to scale inference-time compute of open models☆1,123Updated 7 months ago
- PyTorch-native post-training at scale☆595Updated this week
- ☆2,546Updated this week
- Training Large Language Model to Reason in a Continuous Latent Space☆1,449Updated 5 months ago
- PyTorch building blocks for the OLMo ecosystem☆681Updated last week
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆584Updated 5 months ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆576Updated 3 months ago
- Optimize prompts, code, and more with AI-powered Reflective Text Evolution☆2,078Updated last week
- OpenTinker is an RL-as-a-Service infrastructure for foundation models☆577Updated this week
- A project to improve skills of large language models☆756Updated last week
- Open-source framework for the research and development of foundation models.☆711Updated this week
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,740Updated 9 months ago