thinking-machines-lab/tinker-cookbook

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thinking-machines-lab/tinker-cookbook)

thinking-machines-lab / tinker-cookbook

Post-training with Tinker

☆3,869

Alternatives and similar repositories for tinker-cookbook

Users that are interested in tinker-cookbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thinking-machines-lab / tinker
View on GitHub
Training API and CLI
☆637Updated this week
NovaSky-AI / SkyRL
View on GitHub
SkyRL: A Modular Full-stack RL Library for LLMs
☆2,085Updated this week
THUDM / slime
View on GitHub
slime is an LLM post-training framework for RL Scaling.
☆7,569Updated this week
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,587Updated this week
PrimeIntellect-ai / verifiers
View on GitHub
Our library for RL environments + evals
☆4,390Updated this week
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
thinking-machines-lab / tinker-project-ideas
View on GitHub
Ideas for projects related to Tinker
☆191Nov 6, 2025Updated 8 months ago
PrimeIntellect-ai / prime-rl
View on GitHub
Agentic RL Training at Scale
☆1,702Updated this week
rllm-org / rllm
View on GitHub
Democratizing Reinforcement Learning for LLMs
☆5,708Updated this week
radixark / miles
View on GitHub
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
☆1,761Updated this week
areal-project / AReaL
View on GitHub
The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.
☆5,579Updated this week
thinking-machines-lab / batch_invariant_ops
View on GitHub
☆1,046Nov 4, 2025Updated 8 months ago
open-tinker / OpenTinker
View on GitHub
OpenTinker is an RL-as-a-Service infrastructure for foundation models
☆676Mar 21, 2026Updated 4 months ago
NVIDIA-NeMo / RL
View on GitHub
Scalable toolkit for efficient model reinforcement
☆1,840Updated this week
sgl-project / sglang
View on GitHub
SGLang is a high-performance serving framework for large language models and multimodal models.
☆30,583Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
OpenPipe / ART
View on GitHub
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…
☆10,505Updated this week
OpenRLHF / OpenRLHF
View on GitHub
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…
☆9,831Jul 14, 2026Updated last week
meta-pytorch / torchforge
View on GitHub
PyTorch-native post-training at scale
☆696Updated this week
PeterGriffinJin / Search-R1
View on GitHub
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
☆5,130Nov 13, 2025Updated 8 months ago
huggingface / trl
View on GitHub
Train transformer language models with reinforcement learning.
☆18,898Updated this week
karpathy / nanochat
View on GitHub
The best ChatGPT that $100 can buy.
☆56,507Jul 4, 2026Updated 2 weeks ago
lasgroup / SDPO
View on GitHub
Reinforcement Learning via Self-Distillation (SDPO)
☆1,017Jul 1, 2026Updated 3 weeks ago
allenai / open-instruct
View on GitHub
AllenAI's post-training codebase
☆3,803Updated this week
openai / harmony
View on GitHub
Renderer for the harmony response format to be used with gpt-oss
☆4,458Apr 8, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
EleutherAI / lm-evaluation-harness
View on GitHub
A framework for few-shot evaluation of language models.
☆13,359Jul 13, 2026Updated last week
mll-lab-nu / RAGEN
View on GitHub
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
☆2,753Apr 14, 2026Updated 3 months ago
vllm-project / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆86,804Updated this week
huggingface / OpenEnv
View on GitHub
An interface library for RL post training with environments.
☆2,439Updated this week
huggingface / Math-Verify
View on GitHub
☆1,170Jan 10, 2026Updated 6 months ago
KellerJordan / modded-nanogpt
View on GitHub
NanoGPT (124M) in 90 seconds
☆5,548Jul 3, 2026Updated 2 weeks ago
unslothai / unsloth
View on GitHub
Unsloth is a local UI for training and running Gemma 4, Qwen3.6, DeepSeek, Kimi, GLM and other models.
☆68,666Updated this week
alibaba / ROLL
View on GitHub
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
☆3,313Updated this week
GeeeekExplorer / nano-vllm
View on GitHub
Nano vLLM
☆14,582Apr 26, 2026Updated 2 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
sgl-project / mini-sglang
View on GitHub
A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
☆4,616May 17, 2026Updated 2 months ago
harbor-framework / harbor
View on GitHub
Framework for evaluating and improving agents
☆3,348Updated this week
Jiayi-Pan / TinyZero
View on GitHub
Minimal reproduction of DeepSeek R1-Zero
☆13,197Feb 27, 2026Updated 4 months ago
MoonshotAI / checkpoint-engine
View on GitHub
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
☆972Jul 4, 2026Updated 2 weeks ago
open-thought / reasoning-gym
View on GitHub
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
☆1,463Apr 17, 2026Updated 3 months ago
huggingface / open-r1
View on GitHub
Fully open reproduction of DeepSeek-R1
☆26,414Apr 2, 2026Updated 3 months ago
google / tunix
View on GitHub
A Lightweight LLM Post-Training Library
☆2,378Updated this week