Mrpatekful / swatsLinks
Unofficial implementation of Switching from Adam to SGD optimization in PyTorch.
☆66Updated 2 years ago
Alternatives and similar repositories for swats
Users that are interested in swats are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of the hamburger module from the ICLR 2021 paper "Is Attention Better Than Matrix Decomposition"☆99Updated 4 years ago
- [ICML 2020] code for the flooding regularizer proposed in "Do We Need Zero Training Loss After Achieving Zero Training Error?"☆92Updated 2 years ago
- Implementation and experiments for AdamW on Pytorch☆94Updated 5 years ago
- Implements https://arxiv.org/abs/1711.05101 AdamW optimizer, cosine learning rate scheduler and "Cyclical Learning Rates for Training Neu…☆149Updated 6 years ago
- pytorch implement of Lookahead Optimizer☆191Updated 3 years ago
- pytorch implementation of basic kmeans algorithm(lloyd method with forgy initialization) with gpu support☆93Updated 7 years ago
- This in my Demo of Chen et al. "GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks" ICML 2018☆178Updated 3 years ago
- lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch☆337Updated 5 years ago
- Apollo: An Adaptive Parameter-wise Diagonal Quasi-Newton Method for Nonconvex Stochastic Optimization☆183Updated 3 years ago
- Multi-Task Learning Framework on PyTorch. State-of-the-art methods are implemented to effectively train models on multiple tasks.☆149Updated 6 years ago
- AdaX: Adaptive Gradient Descent with Exponential Long Term Momery☆34Updated 5 years ago
- Official implementation of Auxiliary Learning by Implicit Differentiation [ICLR 2021]☆84Updated 11 months ago
- Loss and accuracy go opposite ways...right?☆94Updated 5 years ago
- ☆47Updated 4 years ago
- Reproducing the Linear Multihead Attention introduced in Linformer paper (Linformer: Self-Attention with Linear Complexity)☆76Updated 5 years ago
- Pytorch implementation of Learning Rate Dropout.☆42Updated 5 years ago
- Implementation of Online Label Smoothing in PyTorch☆94Updated 2 years ago
- Mish Deep Learning Activation Function for PyTorch / FastAI☆161Updated 5 years ago
- [ICML 2020] code for "PowerNorm: Rethinking Batch Normalization in Transformers" https://arxiv.org/abs/2003.07845☆120Updated 4 years ago
- Unsupervised Data Augmentation experiments in PyTorch☆60Updated 5 years ago
- PyTorch Examples repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆61Updated 11 months ago
- ☆147Updated 3 years ago
- Bootstrapping loss function implementation in pytorch☆36Updated 4 years ago
- Useful PyTorch functions and modules that are not implemented in PyTorch by default☆188Updated last year
- Implementation for our WACV 2021 paper "Multi-Loss Weighting with Coefficient of Variations"☆50Updated 4 years ago
- Official PyTorch Repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆409Updated 11 months ago
- Feature extraction made simple with torchextractor☆101Updated 4 years ago
- EnAET: Self-Trained Ensemble AutoEncoding Transformations for Semi-Supervised Learning☆69Updated 2 years ago
- Unofficial PyTorch Implementation of EvoNorm☆122Updated 3 years ago
- A Pytorch implementation of "LegoNet: Efficient Convolutional Neural Networks with Lego Filters" (ICML 2019).☆140Updated 4 years ago