thinking-machines-lab / tinkerLinks
Training API and CLI
☆266Updated this week
Alternatives and similar repositories for tinker
Users that are interested in tinker are comparing it to the libraries listed below
Sorting:
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆329Updated last week
- PyTorch-native post-training at scale☆566Updated this week
- ☆937Updated last month
- Training-Ready RL Environments + Evals☆190Updated last week
- Simple & Scalable Pretraining for Neural Architecture Research☆305Updated 2 weeks ago
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆340Updated last month
- ☆208Updated 4 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆179Updated 5 months ago
- ☆115Updated last week
- Understand and test language model architectures on synthetic tasks.☆246Updated 2 months ago
- ☆234Updated 5 months ago
- ☆162Updated 4 months ago
- MoE training for Me and You and maybe other people☆239Updated this week
- A MAD laboratory to improve AI architecture designs 🧪☆135Updated last year
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆321Updated last month
- A Gym for Agentic LLMs☆404Updated last month
- Open source interpretability artefacts for R1.☆165Updated 7 months ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆328Updated last month
- Physics of Language Models, Part 4☆270Updated last week
- Open-source framework for the research and development of foundation models.☆658Updated this week
- Async RL Training at Scale☆938Updated last week
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆301Updated this week
- RLP: Reinforcement as a Pretraining Objective☆213Updated 2 months ago
- rl from zero pretrain, can it be done? yes.☆282Updated 2 months ago
- Normalized Transformer (nGPT)☆193Updated last year
- Storing long contexts in tiny caches with self-study☆220Updated 2 weeks ago
- Ideas for projects related to Tinker☆129Updated last month
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆582Updated last month
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆331Updated last week
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆277Updated 3 weeks ago