fadel / pytorch_emaLinks

Tiny PyTorch library for maintaining a moving average of a collection of parameters.

☆439

Alternatives and similar repositories for pytorch_ema

Users that are interested in pytorch_ema are comparing it to the libraries listed below

Sorting:

Tony-Y / pytorch_warmup
Learning Rate Warmup in PyTorch
☆414Updated 5 months ago
katsura-jp / pytorch-cosine-annealing-with-warmup
☆466Updated 2 years ago
lucidrains / ema-pytorch
A simple way to keep track of an Exponential Moving Average (EMA) version of your Pytorch model
☆624Updated 11 months ago
locuslab / convmixer
Implementation of ConvMixer for "Patches Are All You Need? 🤷"
☆1,078Updated 3 years ago
kakaobrain / torchlars
A LARS implementation in PyTorch
☆353Updated 5 years ago
wzlxjtu / PositionalEncoding2D
A PyTorch implementation of the 1d and 2d Sinusoidal positional encoding/embedding.
☆260Updated 5 years ago
xxxnell / how-do-vits-work
(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"
☆820Updated 3 years ago
ildoonet / pytorch-gradual-warmup-lr
Gradually-Warmup Learning Rate Scheduler for PyTorch
☆993Updated last year
lucidrains / mlp-mixer-pytorch
An All-MLP solution for Vision, from Google AI
☆1,053Updated 4 months ago
lucidrains / transformer-in-transformer
Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorc…
☆309Updated 3 years ago
google-research / sam
☆611Updated 3 months ago
DeMoriarty / fast_pytorch_kmeans
This is a pytorch implementation of k-means clustering algorithm
☆333Updated 8 months ago
imbue-ai / self_supervised
A Pytorch-Lightning implementation of self-supervised algorithms
☆545Updated 3 years ago
fkodom / fft-conv-pytorch
Implementation of 1D, 2D, and 3D FFT convolutions in PyTorch. Much faster than direct convolutions for large kernel sizes.
☆513Updated 2 years ago
sail-sg / Adan
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
☆803Updated 5 months ago
lucidrains / linformer
Implementation of Linformer for Pytorch
☆302Updated last year
vballoli / nfnets-pytorch
NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch. Find explanation at tourdeml.github.io/blog/
☆349Updated last year
lucidrains / axial-attention
Implementation of Axial attention - attending to multi-dimensional data efficiently
☆391Updated 4 years ago
nocotan / pytorch-lightning-gans
Collection of PyTorch Lightning implementations of Generative Adversarial Network varieties presented in research papers.
☆170Updated 8 months ago
lucidrains / perceiver-pytorch
Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch
☆1,181Updated 2 years ago
lukemelas / do-you-even-need-attention
Is the attention layer even necessary? (https://arxiv.org/abs/2105.02723)
☆483Updated 4 years ago
AdeelH / pytorch-multi-class-focal-loss
An (unofficial) implementation of Focal Loss, as described in the RetinaNet paper, generalized to the multi-class case.
☆239Updated last year
assafshocher / ResizeRight
The correct way to resize images or tensors. For Numpy or Pytorch (differentiable).
☆566Updated 2 years ago
microsoft / esvit
EsViT: Efficient self-supervised Vision Transformers
☆411Updated 2 years ago
rishikksh20 / MLP-Mixer-pytorch
Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision
☆217Updated 4 years ago
lucidrains / linear-attention-transformer
Transformer based on a variant of attention that is linear complexity in respect to sequence length
☆814Updated last year
facebookresearch / msn
Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)
☆463Updated 3 years ago
SHI-Labs / Compact-Transformers
Escaping the Big Data Paradigm with Compact Transformers, 2021 (Train your Vision Transformers in 30 mins on CIFAR-10 with a single GPU!)
☆537Updated last year
lessw2020 / Ranger21
Ranger deep learning optimizer rewrite to use newest components
☆338Updated last year
cmsflash / efficient-attention
An implementation of the efficient attention module.
☆326Updated 5 years ago