katsura-jp / pytorch-cosine-annealing-with-warmup
☆440Updated last year
Related projects ⓘ
Alternatives and complementary repositories for pytorch-cosine-annealing-with-warmup
- Tiny PyTorch library for maintaining a moving average of a collection of parameters.☆406Updated last month
- Learning Rate Warmup in PyTorch☆392Updated this week
- (ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"☆806Updated 2 years ago
- Gradually-Warmup Learning Rate Scheduler for PyTorch☆977Updated last month
- AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights (ICLR 2021)☆415Updated 3 years ago
- NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch. Find explanation at tourdeml.github.io/blog/☆345Updated 10 months ago
- Escaping the Big Data Paradigm with Compact Transformers, 2021 (Train your Vision Transformers in 30 mins on CIFAR-10 with a single GPU!)☆500Updated 2 weeks ago
- Implementation of ConvMixer for "Patches Are All You Need? 🤷"☆1,062Updated 2 years ago
- Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)☆449Updated 2 years ago
- A simple way to keep track of an Exponential Moving Average (EMA) version of your Pytorch model☆517Updated last month
- A PyTorch implementation of "CoAtNet: Marrying Convolution and Attention for All Data Sizes"☆369Updated 3 years ago
- A LARS implementation in PyTorch☆335Updated 4 years ago
- Unofficial PyTorch implementation of "Meta Pseudo Labels"☆383Updated 10 months ago
- Image Test Time Augmentation with PyTorch!☆978Updated last year
- An All-MLP solution for Vision, from Google AI☆1,003Updated 2 months ago
- SAM: Sharpness-Aware Minimization (PyTorch)☆1,770Updated 9 months ago
- Deep Learning project template for PyTorch (multi-gpu training is supported)☆134Updated last year
- This is an official implementation for "Self-Supervised Learning with Swin Transformers".☆627Updated 3 years ago
- EsViT: Efficient self-supervised Vision Transformers☆408Updated last year
- Pre-trained NFNets with 99% of the accuracy of the official paper "High-Performance Large-Scale Image Recognition Without Normalization".☆159Updated 3 years ago
- 🛠 Toolbox to extend PyTorch functionalities☆417Updated 6 months ago
- Unofficial PyTorch Reimplementation of RandAugment.☆628Updated last year
- An (unofficial) implementation of Focal Loss, as described in the RetinaNet paper, generalized to the multi-class case.☆225Updated 9 months ago
- Implementation of Linformer for Pytorch☆257Updated 10 months ago
- Ranger deep learning optimizer rewrite to use newest components☆323Updated 9 months ago
- A PyTorch implementation of Sharpness-Aware Minimization for Efficiently Improving Generalization☆134Updated 3 years ago
- Explainability for Vision Transformers☆853Updated 2 years ago
- A Pytorch-Lightning implementation of self-supervised algorithms☆536Updated 2 years ago
- PyTorch implementation of SimSiam https//arxiv.org/abs/2011.10566☆1,162Updated last year
- An implementation of 1D, 2D, and 3D positional encoding in Pytorch and TensorFlow☆546Updated 3 weeks ago