thinking-machines-lab / tinker-cookbookLinks
Post-training with Tinker
☆550Updated this week
Alternatives and similar repositories for tinker-cookbook
Users that are interested in tinker-cookbook are comparing it to the libraries listed below
Sorting:
- Async RL Training at Scale☆650Updated last week
- rl from zero pretrain, can it be done? yes.☆274Updated this week
- ☆773Updated 3 weeks ago
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…☆263Updated last week
- Simple & Scalable Pretraining for Neural Architecture Research☆296Updated last month
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆328Updated 10 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆547Updated 2 months ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆281Updated this week
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆472Updated 2 weeks ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆343Updated 9 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆557Updated last month
- Open-source framework for the research and development of foundation models.☆462Updated this week
- Code for the paper: "Learning to Reason without External Rewards"☆357Updated 2 months ago
- ☆225Updated 3 months ago
- Open source interpretability artefacts for R1.☆160Updated 5 months ago
- Training-Ready RL Environments + Evals☆111Updated last week
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆751Updated this week
- Testing baseline LLMs performance across various models☆310Updated last month
- SkyRL: A Modular Full-stack RL Library for LLMs☆906Updated this week
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆701Updated this week
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆535Updated 2 months ago
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,161Updated this week
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆280Updated last month
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆287Updated last week
- ☆476Updated 2 months ago
- ☆103Updated 2 weeks ago
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆407Updated last week
- Build your own visual reasoning model☆409Updated last month
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆125Updated last month
- Exploring Applications of GRPO☆250Updated last month