thinking-machines-lab / tinkerLinks
Training API and CLI
☆323Updated this week
Alternatives and similar repositories for tinker
Users that are interested in tinker are comparing it to the libraries listed below
Sorting:
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆357Updated this week
- PyTorch-native post-training at scale☆600Updated last week
- ☆93Updated last week
- ☆952Updated 2 months ago
- A Gym for Agentic LLMs☆437Updated last week
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆343Updated 2 months ago
- Open-source framework for the research and development of foundation models.☆731Updated last week
- Simple & Scalable Pretraining for Neural Architecture Research☆307Updated last month
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆344Updated last month
- rl from zero pretrain, can it be done? yes.☆286Updated 4 months ago
- Curated collection of community environments☆208Updated this week
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆333Updated 2 months ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆288Updated 2 months ago
- Async RL Training at Scale☆1,020Updated last week
- [ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a Pretraining Objective☆226Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆186Updated last week
- ☆237Updated 3 weeks ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆370Updated last year
- Understand and test language model architectures on synthetic tasks.☆251Updated 2 weeks ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆621Updated 3 weeks ago
- Dion optimizer algorithm☆420Updated 2 weeks ago
- Storing long contexts in tiny caches with self-study☆231Updated last month
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆313Updated last month
- AIRA-dojo: a framework for developing and evaluating AI research agents☆124Updated last week
- Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality☆314Updated 3 weeks ago
- MoE training for Me and You and maybe other people☆331Updated 3 weeks ago
- Open-source release accompanying Gao et al. 2025☆498Updated last month
- Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…☆418Updated last week
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆127Updated 3 months ago
- Official JAX implementation of End-to-End Test-Time Training for Long Context☆478Updated last week