thinking-machines-lab / tinkerLinks
Training API and CLI
☆325Updated last week
Alternatives and similar repositories for tinker
Users that are interested in tinker are comparing it to the libraries listed below
Sorting:
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆361Updated this week
- PyTorch-native post-training at scale☆613Updated this week
- ☆961Updated 3 months ago
- ☆394Updated last week
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆334Updated 3 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆371Updated last year
- Open-source framework for the research and development of foundation models.☆752Updated this week
- ☆237Updated last month
- Async RL Training at Scale☆1,044Updated this week
- Lightly-reviewed collection of community environments☆210Updated 2 weeks ago
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆344Updated 3 months ago
- rl from zero pretrain, can it be done? yes.☆286Updated 4 months ago
- Official JAX implementation of End-to-End Test-Time Training for Long Context☆520Updated 2 weeks ago
- Storing long contexts in tiny caches with self-study☆236Updated 2 months ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆292Updated 2 months ago
- ☆232Updated 2 months ago
- A Gym for Agentic LLMs☆444Updated 3 weeks ago
- Simple & Scalable Pretraining for Neural Architecture Research☆307Updated 2 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆344Updated last month
- MoE training for Me and You and maybe other people☆335Updated last month
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆186Updated 3 weeks ago
- [ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a Pretraining Objective☆231Updated 2 weeks ago
- Normalized Transformer (nGPT)☆198Updated last year
- Dion optimizer algorithm☆431Updated 3 weeks ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆128Updated 4 months ago
- Open-source release accompanying Gao et al. 2025☆501Updated last month
- Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.☆830Updated last week
- Exploring Applications of GRPO☆251Updated 5 months ago
- Ideas for projects related to Tinker☆164Updated 3 months ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆140Updated 5 months ago