Tony-Y / pytorch_warmup
Learning Rate Warmup in PyTorch
☆403Updated 2 weeks ago
Alternatives and similar repositories for pytorch_warmup:
Users that are interested in pytorch_warmup are comparing it to the libraries listed below
- Gradually-Warmup Learning Rate Scheduler for PyTorch☆986Updated 4 months ago
- ☆450Updated last year
- Tiny PyTorch library for maintaining a moving average of a collection of parameters.☆421Updated 4 months ago
- 🛠 Toolbox to extend PyTorch functionalities☆419Updated 9 months ago
- An All-MLP solution for Vision, from Google AI☆1,013Updated 5 months ago
- Source code for "On the Relationship between Self-Attention and Convolutional Layers"☆1,096Updated 2 years ago
- Implementation of ConvMixer for "Patches Are All You Need? 🤷"☆1,064Updated 2 years ago
- An (unofficial) implementation of Focal Loss, as described in the RetinaNet paper, generalized to the multi-class case.☆230Updated last year
- Implementation of Axial attention - attending to multi-dimensional data efficiently☆371Updated 3 years ago
- A PyTorch Implementation of Focal Loss.☆974Updated 5 years ago
- Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision☆217Updated 3 years ago
- Pytorch implementation of the paper "Class-Balanced Loss Based on Effective Number of Samples"☆790Updated last year
- Ranger deep learning optimizer rewrite to use newest components☆327Updated last year
- Implementation of Linformer for Pytorch☆266Updated last year
- This is an official implementation for "Self-Supervised Learning with Swin Transformers".☆640Updated 3 years ago
- Implementing Stand-Alone Self-Attention in Vision Models using Pytorch☆455Updated 5 years ago
- Transformer based on a variant of attention that is linear complexity in respect to sequence length☆738Updated 9 months ago
- Unofficial PyTorch Reimplementation of RandAugment.☆631Updated last year
- A Pytorch-Lightning implementation of self-supervised algorithms☆537Updated 2 years ago
- SAM: Sharpness-Aware Minimization (PyTorch)☆1,830Updated 11 months ago
- Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase☆1,196Updated last year
- Official Pytorch Implementation of: "Asymmetric Loss For Multi-Label Classification"(ICCV, 2021) paper☆740Updated last year
- A learning rate range test implementation in PyTorch☆945Updated 2 months ago
- Compute CNN receptive field size in pytorch in one line☆357Updated 9 months ago
- Code snippets created for the PyTorch discussion board☆557Updated 4 years ago
- NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch. Find explanation at tourdeml.github.io/blog/☆345Updated last year
- PyTorch implementation of Contrastive Learning methods☆1,960Updated last year
- Image Test Time Augmentation with PyTorch!☆992Updated last year
- (ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"☆809Updated 2 years ago
- Code for the Convolutional Vision Transformer (ConViT)☆466Updated 3 years ago