Tony-Y / pytorch_warmup
Learning Rate Warmup in PyTorch
☆410Updated last month
Alternatives and similar repositories for pytorch_warmup:
Users that are interested in pytorch_warmup are comparing it to the libraries listed below
- Gradually-Warmup Learning Rate Scheduler for PyTorch☆989Updated 6 months ago
- Tiny PyTorch library for maintaining a moving average of a collection of parameters.☆428Updated 7 months ago
- ☆457Updated 2 years ago
- A PyTorch Implementation of Focal Loss.☆982Updated 5 years ago
- Pytorch implementation of the paper "Class-Balanced Loss Based on Effective Number of Samples"☆797Updated last year
- Compute CNN receptive field size in pytorch in one line☆360Updated 11 months ago
- 🛠 Toolbox to extend PyTorch functionalities☆419Updated 11 months ago
- A PyTorch implementation of the 1d and 2d Sinusoidal positional encoding/embedding.☆253Updated 4 years ago
- An (unofficial) implementation of Focal Loss, as described in the RetinaNet paper, generalized to the multi-class case.☆235Updated last year
- A simple way to keep track of an Exponential Moving Average (EMA) version of your Pytorch model☆577Updated 5 months ago
- Implementing Stand-Alone Self-Attention in Vision Models using Pytorch☆455Updated 5 years ago
- SAM: Sharpness-Aware Minimization (PyTorch)☆1,867Updated last year
- An All-MLP solution for Vision, from Google AI☆1,020Updated 7 months ago
- PyTorch implementation of Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning☆491Updated 2 years ago
- Ranger deep learning optimizer rewrite to use newest components☆329Updated last year
- NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch. Find explanation at tourdeml.github.io/blog/☆345Updated last year
- PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations by T. Chen et al.☆787Updated 11 months ago
- Transformer based on a variant of attention that is linear complexity in respect to sequence length☆763Updated last year
- My best practice of training large dataset using PyTorch.☆1,097Updated 11 months ago
- Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch☆1,138Updated last year
- mixup: Beyond Empirical Risk Minimization☆1,178Updated 3 years ago
- Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorc…☆304Updated 3 years ago
- A learning rate range test implementation in PyTorch☆955Updated last week
- Implementation of 1D, 2D, and 3D FFT convolutions in PyTorch. Much faster than direct convolutions for large kernel sizes.☆494Updated last year
- A Pytorch-Lightning implementation of self-supervised algorithms☆538Updated 3 years ago
- Implementation of ConvMixer for "Patches Are All You Need? 🤷"☆1,071Updated 2 years ago
- Unofficial PyTorch implementation of "Meta Pseudo Labels"☆387Updated last year
- Implementation of DropBlock: A regularization method for convolutional networks in PyTorch.☆594Updated 4 years ago
- Official Pytorch Implementation of: "Asymmetric Loss For Multi-Label Classification"(ICCV, 2021) paper☆757Updated last year
- Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase☆1,201Updated last year