test-time-training/ttt-lm-kernels

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/test-time-training/ttt-lm-kernels)

test-time-training / ttt-lm-kernels

Inference Speed Benchmark for Learning to (Learn at Test Time): RNNs with Expressive Hidden States

☆92

Alternatives and similar repositories for ttt-lm-kernels

Users that are interested in ttt-lm-kernels are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

test-time-training / ttt-lm-jax
View on GitHub
Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
☆461Nov 2, 2025Updated 8 months ago
test-time-training / ttt-lm-pytorch
View on GitHub
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
☆1,384Jul 14, 2024Updated 2 years ago
AlirezaMorsali / MLP-Attention
View on GitHub
☆17Dec 19, 2024Updated last year
SmerkyG / GoldFinch-paper
View on GitHub
GoldFinch and other hybrid transformer components
☆16Dec 9, 2025Updated 7 months ago
tinkoff-ai / cnf
View on GitHub
Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…
☆12Jan 31, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
test-time-training / mttt
View on GitHub
Official JAX implementation of Learning to (Learn at Test Time)
☆92Nov 16, 2023Updated 2 years ago
OpenNLPLab / HGRN2
View on GitHub
HGRN2: Gated Linear RNNs with State Expansion
☆58Aug 20, 2024Updated last year
test-time-training / ttt-tk
View on GitHub
☆45Nov 1, 2025Updated 8 months ago
UbiquitousLearning / NNV12
View on GitHub
☆13May 11, 2023Updated 3 years ago
berlino / gated_linear_attention
View on GitHub
☆107Mar 9, 2024Updated 2 years ago
test-time-training / e2e
View on GitHub
Official JAX implementation of End-to-End Test-Time Training for Long Context
☆627Feb 15, 2026Updated 5 months ago
alxmamaev / ultimate_tts
View on GitHub
☆13Aug 7, 2021Updated 4 years ago
jina-ai / textbook
View on GitHub
distill chatGPT coding ability into small model (1b)
☆31Sep 7, 2023Updated 2 years ago
sekstini / gpupoor
View on GitHub
☆18Dec 2, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
corl-team / lime
View on GitHub
Official implementation of the paper "You Do Not Fully Utilize Transformer's Representation Capacity"
☆32May 28, 2025Updated last year
EugenHotaj / llm_parallelisms.c
View on GitHub
LLM training parallelisms (DP, FSDP, TP, PP) in pure C
☆29Jan 27, 2026Updated 6 months ago
ammar-n-abbas / Predictive-Maintenance-BC-IOHMM-DRL
View on GitHub
Hierarchical Framework for Interpretable Deep Reinforcement Learning Based- Predictive Maintenance (Applied to NASA Turbofan engine datas…
☆14Feb 9, 2024Updated 2 years ago
zhixuan-lin / forgetting-transformer
View on GitHub
[ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning
☆150Feb 25, 2026Updated 5 months ago
recursal / RADLADS-paper
View on GitHub
RADLADS training code
☆46May 7, 2025Updated last year
berlino / seq_icl
View on GitHub
☆54May 20, 2024Updated 2 years ago
sri9s / tinystories-language-models
View on GitHub
Exploring the minimal architecture required for coherent English language generation.
☆16Jun 11, 2026Updated last month
OpenDroneMap / odm_orthophoto
View on GitHub
ODM Orthophoto Module - Rasterize Textured 3D Models to PNG
☆14Feb 25, 2025Updated last year
aaberdam / AdaLISTA
View on GitHub
Ada-LISTA: Learned Solvers Adaptive to Varying Models
☆11Feb 18, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yurakuratov / hidden_capacity
View on GitHub
Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity (ACL 2025, oral)
☆35Jun 14, 2025Updated last year
socialfoundations / tttlm
View on GitHub
Test-time-training on nearest neighbors for large language models
☆50Apr 18, 2024Updated 2 years ago
sigaloid / mutter
View on GitHub
Easy-to-use Rust bindings to the Whisper.cpp machine learning transcription library!
☆26Oct 31, 2025Updated 8 months ago
PraveenRaja42 / Tiny-Stories-GPT
View on GitHub
A minimal PyTorch re-implementation of GPT (Generative Pretrained Transformer) language model training
☆19Sep 15, 2023Updated 2 years ago
thunlp / EREN
View on GitHub
Official codes for COLING 2024 paper "Robust and Scalable Model Editing for Large Language Models": https://arxiv.org/abs/2403.17431v1
☆14Mar 27, 2024Updated 2 years ago
zhouhanxie / PRAG
View on GitHub
☆12May 13, 2023Updated 3 years ago
mtanveer1 / AVSEC-3-Challenge
View on GitHub
Audio-Visual Speech Enhancement Challenge (AVSE) 2024
☆12Feb 6, 2026Updated 5 months ago
tinkoff-ai / lb-sac
View on GitHub
Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…
☆21Feb 27, 2023Updated 3 years ago
gau-nernst / kokoro
View on GitHub
https://hf.co/hexgrad/Kokoro-82M
☆14Jan 14, 2026Updated 6 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
athms / mad-lab
View on GitHub
A MAD laboratory to improve AI architecture designs 🧪
☆147Dec 17, 2024Updated last year
meulemansalex / theoretical_framework_for_target_propagation
View on GitHub
Python implementation of the methods in Meulemans et al. 2020 - A Theoretical Framework For Target Propagation
☆31Oct 31, 2024Updated last year
sepermetric / seper
View on GitHub
SePer is an accurate / fast / free-of-API metric to measure document quality via information gain
☆33Feb 22, 2026Updated 5 months ago
Trainy-ai / pluto
View on GitHub
Next Generation Experimental Tracking for Machine Learning Operations
☆17Updated this week
smonsays / hypernetwork-attention
View on GitHub
Official code for the paper "Attention as a Hypernetwork"
☆58Feb 24, 2026Updated 5 months ago
google-deepmind / exedec
View on GitHub
☆14May 9, 2024Updated 2 years ago
choidami / inductive-oocr
View on GitHub
☆16Mar 22, 2025Updated last year