test-time-training/e2e

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/test-time-training/e2e)

test-time-training / e2e

Official JAX implementation of End-to-End Test-Time Training for Long Context

☆542

Alternatives and similar repositories for e2e

Users that are interested in e2e are comparing it to the libraries listed below

Sorting:

SalesforceAIResearch / PretrainRL-pipeline
View on GitHub
An automated data pipeline scaling RL to pretraining levels
☆72Oct 11, 2025Updated 4 months ago
robert-lieck / RBN
View on GitHub
Recursive Bayesian Networks
☆11May 11, 2025Updated 9 months ago
a1600012888 / LaCT
View on GitHub
Code release for paper "Test-Time Training Done Right"
☆379Jan 5, 2026Updated last month
CSU-JPG / MIND
View on GitHub
The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.
☆41Feb 10, 2026Updated 3 weeks ago
HazyResearch / cartridges
View on GitHub
Storing long contexts in tiny caches with self-study
☆243Dec 5, 2025Updated 2 months ago
ByteDance-Seed / Stable-DiffCoder
View on GitHub
Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …
☆75Jan 23, 2026Updated last month
RUC-NLPIR / OmniGAIA
View on GitHub
OmniGAIA: Towards Native Omni-Modal AI Agents
☆46Updated this week
Dao-AILab / grouped-latent-attention
View on GitHub
☆134May 29, 2025Updated 9 months ago
QwenLM / ParScale
View on GitHub
Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
☆472May 17, 2025Updated 9 months ago
aakaran / reasoning-with-sampling
View on GitHub
☆399Nov 7, 2025Updated 3 months ago
yifanzhang-pro / HLA
View on GitHub
Official Project Page for HLA: Higher-order Linear Attention (https://arxiv.org/abs/2510.27258)
☆45Jan 6, 2026Updated last month
hyp1231 / ICLR2023-OpenReviewData
View on GitHub
Crawl & visualize ICLR papers and reviews.
☆18Nov 5, 2022Updated 3 years ago
ant-research / long-context-modeling
View on GitHub
Research work aimed at addressing the problem of modeling infinite-length context
☆46Dec 18, 2025Updated 2 months ago
tokenbender / mHC-manifold-constrained-hyper-connections
View on GitHub
implementations and experimentation on mHC by deepseek - https://arxiv.org/abs/2512.24880
☆317Feb 17, 2026Updated 2 weeks ago
test-time-training / discover
View on GitHub
☆468Feb 22, 2026Updated last week
tilde-research / nsa-impl
View on GitHub
An efficient implementation of the NSA (Native Sparse Attention) kernel
☆129Jun 24, 2025Updated 8 months ago
yuezhouhu / residual-context-diffusion
View on GitHub
Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.
☆54Feb 11, 2026Updated 3 weeks ago
PRIME-RL / RL-Compositionality
View on GitHub
FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones
☆64Jan 26, 2026Updated last month
moonquest-ai / SRDA
View on GitHub
☆30Jun 7, 2025Updated 8 months ago
fla-org / flash-linear-attention
View on GitHub
🚀 Efficient implementations of state-of-the-art linear attention models
☆4,428Updated this week
Zoeyyao27 / SirLLM
View on GitHub
This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM
☆60May 28, 2024Updated last year
nil0x9 / flash-muon
View on GitHub
Flash-Muon: An Efficient Implementation of Muon Optimizer
☆237Jun 15, 2025Updated 8 months ago
InternRobotics / MMSI-Video-Bench
View on GitHub
MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence
☆55Feb 10, 2026Updated 3 weeks ago
satrams / rent-rl
View on GitHub
RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.
☆41Oct 31, 2025Updated 4 months ago
GAIR-NLP / AIME-Preview
View on GitHub
☆80Mar 11, 2025Updated 11 months ago
glassroom / heinsen_attention
View on GitHub
Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
☆24Jun 6, 2024Updated last year
SakanaAI / self-adaptive-llms
View on GitHub
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
☆1,189Jan 30, 2025Updated last year
showlab / FocusUI
View on GitHub
[CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection
☆25Feb 10, 2026Updated 3 weeks ago
goombalab / Gather-and-Aggregate
View on GitHub
Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"
☆14Apr 30, 2025Updated 10 months ago
WinnieHAN / structure_adv
View on GitHub
☆10Oct 28, 2020Updated 5 years ago
JmlrOrg / dmlr-style-file
View on GitHub
☆12Nov 21, 2023Updated 2 years ago
Doraemonzzz / nanoTransNormer
View on GitHub
☆11Oct 11, 2023Updated 2 years ago
YS-IMTech / HyperDreamer
View on GitHub
(Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"
☆10Dec 9, 2023Updated 2 years ago
da03 / criticize_text_generation
View on GitHub
A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …
☆11Mar 18, 2023Updated 2 years ago
ypwang61 / ThetaEvolve
View on GitHub
ThetaEvolve: Test-time Learning on Open Problems, enabling RL training on AlphaEvolve/OpenEvolve and emphasizing scaling test-time comput…
☆132Updated this week
kvfrans / jaxtransformer
View on GitHub
Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...
☆14May 28, 2025Updated 9 months ago
DozerDB / genai-stack
View on GitHub
Langchain + Docker + Neo4j
☆10Oct 29, 2024Updated last year
feifeibear / ChituAttention
View on GitHub
Quantized Attention on GPU
☆44Nov 22, 2024Updated last year
brendanhogan / completion_tree_view
View on GitHub
☆15Apr 26, 2025Updated 10 months ago