Mrpatekful / swatsLinks

Unofficial implementation of Switching from Adam to SGD optimization in PyTorch.

☆66

Alternatives and similar repositories for swats

Users that are interested in swats are comparing it to the libraries listed below

Sorting:

lucidrains / hamburger-pytorch
Pytorch implementation of the hamburger module from the ICLR 2021 paper "Is Attention Better Than Matrix Decomposition"
☆99Updated 4 years ago
takashiishida / flooding
[ICML 2020] code for the flooding regularizer proposed in "Do We Need Zero Training Loss After Achieving Zero Training Error?"
☆92Updated 2 years ago
egg-west / AdamW-pytorch
Implementation and experiments for AdamW on Pytorch
☆94Updated 5 years ago
mpyrozhok / adamwr
Implements https://arxiv.org/abs/1711.05101 AdamW optimizer, cosine learning rate scheduler and "Cyclical Learning Rates for Training Neu…
☆149Updated 6 years ago
lonePatient / lookahead_pytorch
pytorch implement of Lookahead Optimizer
☆191Updated 3 years ago
overshiki / kmeans_pytorch
pytorch implementation of basic kmeans algorithm(lloyd method with forgy initialization) with gpu support
☆93Updated 7 years ago
hosseinshn / GradNorm
This in my Demo of Chen et al. "GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks" ICML 2018
☆178Updated 3 years ago
alphadl / lookahead.pytorch
lookahead optimizer (Lookahead Optimizer: k steps forward, 1 step back) for pytorch
☆337Updated 5 years ago
XuezheMax / apollo
Apollo: An Adaptive Parameter-wise Diagonal Quasi-Newton Method for Nonconvex Stochastic Optimization
☆183Updated 3 years ago
hav4ik / Hydra
Multi-Task Learning Framework on PyTorch. State-of-the-art methods are implemented to effectively train models on multiple tasks.
☆149Updated 6 years ago
switchablenorms / AdaX
AdaX: Adaptive Gradient Descent with Exponential Long Term Momery
☆34Updated 5 years ago
AvivNavon / AuxiLearn
Official implementation of Auxiliary Learning by Implicit Differentiation [ICLR 2021]
☆84Updated 11 months ago
thegregyang / LossUpAccUp
Loss and accuracy go opposite ways...right?
☆94Updated 5 years ago
izmailovpavel / torch_swa_examples
☆47Updated 4 years ago
kuixu / Linear-Multihead-Attention
Reproducing the Linear Multihead Attention introduced in Linformer paper (Linformer: Self-Attention with Linear Complexity)
☆76Updated 5 years ago
HuangxingLin123 / Learning-Rate-Dropout
Pytorch implementation of Learning Rate Dropout.
☆42Updated 5 years ago
ankandrew / online-label-smoothing-pt
Implementation of Online Label Smoothing in PyTorch
☆94Updated 2 years ago
lessw2020 / mish
Mish Deep Learning Activation Function for PyTorch / FastAI
☆161Updated 5 years ago
sIncerass / powernorm
[ICML 2020] code for "PowerNorm: Rethinking Batch Normalization in Transformers" https://arxiv.org/abs/2003.07845
☆120Updated 4 years ago
vfdev-5 / UDA-pytorch
Unsupervised Data Augmentation experiments in PyTorch
☆60Updated 5 years ago
tbachlechner / ReZero-examples
PyTorch Examples repo for "ReZero is All You Need: Fast Convergence at Large Depth"
☆61Updated 11 months ago
Hui-Li / multi-task-learning-example-PyTorch
☆147Updated 3 years ago
vfdev-5 / BootstrappingLoss
Bootstrapping loss function implementation in pytorch
☆36Updated 4 years ago
pabloppp / pytorch-tools
Useful PyTorch functions and modules that are not implemented in PyTorch by default
☆188Updated last year
rickgroen / cov-weighting
Implementation for our WACV 2021 paper "Multi-Loss Weighting with Coefficient of Variations"
☆50Updated 4 years ago
majumderb / rezero
Official PyTorch Repo for "ReZero is All You Need: Fast Convergence at Large Depth"
☆409Updated 11 months ago
antoinebrl / torchextractor
Feature extraction made simple with torchextractor
☆101Updated 4 years ago
wang3702 / EnAET
EnAET: Self-Trained Ensemble AutoEncoding Transformations for Semi-Supervised Learning
☆69Updated 2 years ago
digantamisra98 / EvoNorm
Unofficial PyTorch Implementation of EvoNorm
☆122Updated 3 years ago
zhaohui-yang / LegoNet
A Pytorch implementation of "LegoNet: Efficient Convolutional Neural Networks with Lego Filters" (ICML 2019).
☆140Updated 4 years ago