Mrpatekful / swatsLinks
Unofficial implementation of Switching from Adam to SGD optimization in PyTorch.
☆66Updated 2 years ago
Alternatives and similar repositories for swats
Users that are interested in swats are comparing it to the libraries listed below
Sorting:
- pytorch implement of Lookahead Optimizer☆195Updated 3 years ago
- lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch☆337Updated 6 years ago
- Implementation and experiments for AdamW on Pytorch☆94Updated 5 years ago
- Implements https://arxiv.org/abs/1711.05101 AdamW optimizer, cosine learning rate scheduler and "Cyclical Learning Rates for Training Neu…☆152Updated 6 years ago
- Pytorch implementation of the hamburger module from the ICLR 2021 paper "Is Attention Better Than Matrix Decomposition"☆99Updated 4 years ago
- Robust Bi-Tempered Logistic Loss Based on Bregman Divergences. https://arxiv.org/pdf/1906.03361.pdf☆147Updated 3 years ago
- This in my Demo of Chen et al. "GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks" ICML 2018☆181Updated 3 years ago
- [ICML 2020] code for the flooding regularizer proposed in "Do We Need Zero Training Loss After Achieving Zero Training Error?"☆93Updated 2 years ago
- Official PyTorch Repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆415Updated last year
- Loss and accuracy go opposite ways...right?☆95Updated 5 years ago
- pytorch implementation of basic kmeans algorithm(lloyd method with forgy initialization) with gpu support☆94Updated 7 years ago
- AdaX: Adaptive Gradient Descent with Exponential Long Term Momery☆34Updated 5 years ago
- Apollo: An Adaptive Parameter-wise Diagonal Quasi-Newton Method for Nonconvex Stochastic Optimization☆182Updated 3 years ago
- ☆47Updated 4 years ago
- Mish Deep Learning Activation Function for PyTorch / FastAI☆161Updated 5 years ago
- Multi-Task Learning Framework on PyTorch. State-of-the-art methods are implemented to effectively train models on multiple tasks.☆149Updated 6 years ago
- Feature extraction made simple with torchextractor☆101Updated 4 years ago
- ☆165Updated 6 years ago
- [ICML 2020] code for "PowerNorm: Rethinking Batch Normalization in Transformers" https://arxiv.org/abs/2003.07845☆120Updated 4 years ago
- Official implementation of Auxiliary Learning by Implicit Differentiation [ICLR 2021]☆86Updated last year
- Pytorch implementation of the image transformer for unconditional image generation☆118Updated last year
- Decoupled Weight Decay Regularization (ICLR 2019)☆282Updated 6 years ago
- Useful PyTorch functions and modules that are not implemented in PyTorch by default☆188Updated last year
- ☆148Updated 3 years ago
- Independently Recurrent Neural Networks (IndRNN) implemented in pytorch.☆138Updated 4 years ago
- Framework for creating (partially) reversible neural networks with PyTorch☆154Updated 3 years ago
- PyTorch Examples repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆62Updated last year
- ☆84Updated 4 years ago
- ☆261Updated 6 years ago
- A Pytorch implementation of "LegoNet: Efficient Convolutional Neural Networks with Lego Filters" (ICML 2019).☆140Updated 4 years ago