thinking-machines-lab / tinkerLinks

Training API and CLI

☆238

Alternatives and similar repositories for tinker

Users that are interested in tinker are comparing it to the libraries listed below

Sorting:

ServiceNow / PipelineRL
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
☆306Updated last week
ekinakyurek / marc
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
☆338Updated 2 weeks ago
thinking-machines-lab / batch_invariant_ops
☆912Updated 3 weeks ago
microsoft / ArchScale
Simple & Scalable Pretraining for Neural Architecture Research
☆302Updated last month
EleutherAI / nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆173Updated 5 months ago
tokenbender / avataRL
rl from zero pretrain, can it be done? yes.
☆281Updated 2 months ago
LeonGuertler / UnstableBaselines
☆106Updated last month
OSU-NLP-Group / GrokkedTransformer
Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
☆234Updated 4 months ago
marin-community / marin
Open-source framework for the research and development of foundation models.
☆629Updated last week
HazyResearch / cartridges
Storing long contexts in tiny caches with self-study
☆217Updated last month
axon-rl / gem
A Gym for Agentic LLMs
☆364Updated 2 weeks ago
pyember / ember
☆233Updated 5 months ago
google-deepmind / regress-lm
Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…
☆294Updated this week
changjonathanc / flex-nano-vllm
FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.
☆305Updated 3 weeks ago
meta-pytorch / torchforge
PyTorch-native post-training at scale
☆546Updated last week
athms / mad-lab
A MAD laboratory to improve AI architecture designs 🧪
☆135Updated 11 months ago
m-a-n-i-f-e-s-t / power-attention
Attention Kernels for Symmetric Power Transformers
☆128Updated 2 months ago
PrimeIntellect-ai / prime-environments
Training-Ready RL Environments + Evals
☆177Updated last week
iliao2345 / CompressARC
☆201Updated 3 months ago
magicproduct / hash-hop
Long context evaluation for large language models
☆224Updated 8 months ago
facebookresearch / RAM
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
☆298Updated 2 weeks ago
arcprize / hierarchical-reasoning-model-analysis
☆157Updated 3 months ago
data-for-agents / insta
Official Repo for InSTA: Towards Internet-Scale Training For Agents
☆56Updated 4 months ago
PrimeIntellect-ai / prime-rl
Async RL Training at Scale
☆780Updated last week
facebookresearch / memory
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…
☆358Updated 11 months ago
arcprize / ARC-AGI-3-Agents
☆96Updated last month
wolfecameron / nanoMoE
An extension of the nanoGPT repository for training small MOE models.
☆215Updated 8 months ago
HazyResearch / zoology
Understand and test language model architectures on synthetic tasks.
☆240Updated 2 months ago
goodfire-ai / r1-interpretability
Open source interpretability artefacts for R1.
☆163Updated 7 months ago
Zyphra / tree_attention
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
☆130Updated 11 months ago