nikitadurasov / torch-tttLinks
A modular and easy-to-use framework for Test-Time Training (TTT) and Test-Time Adaptation (TTA) in Pytorch, making your networks more generalizable with minimal effort ✨
☆22Updated 3 weeks ago
Alternatives and similar repositories for torch-ttt
Users that are interested in torch-ttt are comparing it to the libraries listed below
Sorting:
- Bare-bones implementations of some generative models in Jax: diffusion, normalizing flows, consistency models, flow matching, (beta)-VAEs…☆131Updated last year
- Clifford-Steerable Convolutional Neural Networks [ICML'24]☆48Updated 2 months ago
- High order and sparse layers in pytorch. Lagrange Polynomial, Piecewise Lagrange Polynomial, Piecewise Discontinuous Lagrange Polynomial…☆45Updated last year
- Diffusion models in PyTorch☆107Updated 3 weeks ago
- ☆121Updated 7 months ago
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆99Updated last year
- ☆23Updated 7 months ago
- Erwin: A Tree-based Hierarchical Transformer for Large-scale Physical Systems [ICML'25]☆82Updated 3 weeks ago
- Implicit Convolutional Kernels for Steerable CNNs [NeurIPS'23]☆28Updated 4 months ago
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆110Updated 7 months ago
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆50Updated 11 months ago
- ☆200Updated 7 months ago
- Explorations into the recently proposed Taylor Series Linear Attention☆99Updated 11 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆183Updated 10 months ago
- A tiny library for stochastic dataset caching in PyTorch.☆43Updated last year
- Unofficial implementation of GotenNet, new SOTA 3d equivariant transformer, in Pytorch☆63Updated 3 months ago
- Implementation of papers in 101 lines of code.☆18Updated last year
- Running Jax in PyTorch Lightning☆106Updated 7 months ago
- Use Jax functions in Pytorch☆244Updated 2 years ago
- Flow-matching algorithms in JAX☆97Updated 11 months ago
- ☆150Updated 11 months ago
- This is a port of Mistral-7B model in JAX☆32Updated last year
- code for "Adjoint Sampling: Highly Scalable Diffusion Samplers via Adjoint Matching"☆108Updated 2 months ago
- Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public☆87Updated last month
- Deep Generative Models course, AIMasters, 2022☆46Updated 2 years ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆58Updated last year
- A minimal implementation of Equivariant Neural Fields (https://arxiv.org/abs/2406.05753).☆25Updated 5 months ago
- Code of the paper "Listening to the Noise: Blind Denoising with Gibbs Diffusion"☆33Updated last year
- ☆10Updated 3 years ago
- Run PyTorch in JAX. 🤝☆256Updated 2 weeks ago