thinking-machines-lab / tinkerLinks
Training API and CLI
☆305Updated 3 weeks ago
Alternatives and similar repositories for tinker
Users that are interested in tinker are comparing it to the libraries listed below
Sorting:
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆343Updated 2 weeks ago
- Open-source framework for the research and development of foundation models.☆700Updated this week
- PyTorch-native post-training at scale☆584Updated last week
- ☆945Updated 2 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆306Updated last month
- ☆686Updated this week
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆329Updated 2 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆340Updated 3 weeks ago
- RLP: Reinforcement as a Pretraining Objective☆222Updated 3 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆366Updated last year
- A Gym for Agentic LLMs☆412Updated last week
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆341Updated 2 months ago
- ☆233Updated this week
- Physics of Language Models, Part 4☆291Updated this week
- rl from zero pretrain, can it be done? yes.☆286Updated 3 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆181Updated 6 months ago
- Curated collection of community environments☆200Updated this week
- Async RL Training at Scale☆985Updated this week
- Official JAX implementation of End-to-End Test-Time Training for Long Context☆214Updated last week
- Exploring Applications of GRPO☆251Updated 4 months ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆609Updated this week
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆284Updated last month
- Understand and test language model architectures on synthetic tasks.☆248Updated 3 months ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆334Updated 2 months ago
- Dion optimizer algorithm☆413Updated this week
- OpenTinker is an RL-as-a-Service infrastructure for foundation models☆499Updated last week
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆137Updated 4 months ago
- Storing long contexts in tiny caches with self-study☆228Updated last month
- Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support☆232Updated this week
- Memory optimized Mixture of Experts☆72Updated 5 months ago