nikitadurasov / torch-tttLinks
A modular and easy-to-use framework for Test-Time Training (TTT) and Test-Time Adaptation (TTA) in Pytorch, making your networks more generalizable with minimal effort ✨
☆23Updated this week
Alternatives and similar repositories for torch-ttt
Users that are interested in torch-ttt are comparing it to the libraries listed below
Sorting:
- High order and sparse layers in pytorch. Lagrange Polynomial, Piecewise Lagrange Polynomial, Piecewise Discontinuous Lagrange Polynomial…☆45Updated last year
- Clifford-Steerable Convolutional Neural Networks [ICML'24]☆49Updated 3 months ago
- Bare-bones implementations of some generative models in Jax: diffusion, normalizing flows, consistency models, flow matching, (beta)-VAEs…☆132Updated last year
- The AdEMAMix Optimizer: Better, Faster, Older.☆184Updated 11 months ago
- Erwin: A Tree-based Hierarchical Transformer for Large-scale Physical Systems [ICML'25]☆99Updated last month
- Discrete Bayesian optimization with LLMs, PEFT finetuning methods, and the Laplace approximation.☆19Updated last year
- Diffusion models in PyTorch☆107Updated last month
- Library to make any existing neural network architecture equivariant☆54Updated 9 months ago
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆50Updated last year
- ☆125Updated 8 months ago
- ☆207Updated 8 months ago
- Implicit Convolutional Kernels for Steerable CNNs [NeurIPS'23]☆29Updated 5 months ago
- A tiny library for stochastic dataset caching in PyTorch.☆43Updated last year
- A minimal implementation of Equivariant Neural Fields (https://arxiv.org/abs/2406.05753).☆25Updated 6 months ago
- ☆42Updated last year
- Kolmogorov-Arnold networks (KAN) as implicit functions (like NeRF but simpler)☆14Updated last year
- ☆16Updated 3 years ago
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆100Updated last year
- Running Jax in PyTorch Lightning☆111Updated 7 months ago
- Implementation of "Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains" by Tancik et al.☆100Updated this week
- Transformers with doubly stochastic attention☆46Updated 2 years ago
- This is a port of Mistral-7B model in JAX☆32Updated last year
- PyTorch implementation of Levenberg-Marquardt training algorithm☆73Updated 4 months ago
- Free-form flows are a generative model training a pair of neural networks via maximum likelihood☆47Updated last month
- Exact method for visualizing partitions of a Deep Neural Network, CVPR 2023 Highlight☆109Updated 6 months ago
- Universal Tensor Operations in Einstein-Inspired Notation for Python.☆394Updated 4 months ago
- Code repository of the paper "Variational Stochastic Gradient Descent for Deep Neural Networks" published at☆39Updated 3 months ago
- Fast, Expressive SE(n) Equivariant Networks through Weight-Sharing in Position-Orientation Space.☆85Updated last year
- Explorations into whether a transformer with RL can direct a genetic algorithm to converge faster☆70Updated 2 months ago
- Explorations into the recently proposed Taylor Series Linear Attention☆100Updated 11 months ago