noahgolmant / pytorch-larsLinks

"Layer-wise Adaptive Rate Scaling" in PyTorch

☆87

Alternatives and similar repositories for pytorch-lars

Users that are interested in pytorch-lars are comparing it to the libraries listed below

Sorting:

moskomule / shampoo.pytorch
An implementation of shampoo
☆77Updated 7 years ago
hongyi-zhang / Fixup
A Re-implementation of Fixed-update Initialization
☆152Updated 6 years ago
alecwangcq / EigenDamage-Pytorch
Code for "EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis" https://arxiv.org/abs/1905.05934
☆113Updated 5 years ago
ducha-aiki / LSUV-pytorch
Simple implementation of the LSUV initialization in PyTorch
☆58Updated last year
ppwwyyxx / GroupNorm-reproduce
An official collection of code in different frameworks that reproduces experiments in "Group Normalization"
☆118Updated 4 years ago
toiaydcdyywlhzvlob / backpack
This repository is no longer maintained. Check
☆81Updated 5 years ago
ag14774 / diffdist
☆62Updated 5 years ago
owruby / shake-drop_pytorch
PyTorch implementation of shake-drop regularization
☆55Updated 5 years ago
hysts / pytorch_shake_shake
A PyTorch implementation of shake-shake
☆111Updated 5 years ago
facebookresearch / nds
On Network Design Spaces for Visual Recognition
☆96Updated 5 years ago
moskomule / homura
homura is a library for fast prototyping DL research
☆106Updated 3 years ago
stevenygd / SWALP
Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".
☆62Updated 6 years ago
tbung / pytorch-revnet
Implementation of the reversible residual network in pytorch
☆105Updated 3 years ago
varungohil / Generalizing-Lottery-Tickets
This repository contains code to replicate the experiments given in NeurIPS 2019 paper "One ticket to win them all: generalizing lottery …
☆51Updated last year
BayesWatch / pytorch-prunes
Code for https://arxiv.org/abs/1810.04622
☆141Updated 5 years ago
eladhoffer / utils.pytorch
Utilities for Pytorch
☆89Updated 2 years ago
digantamisra98 / EvoNorm
Unofficial PyTorch Implementation of EvoNorm
☆122Updated 3 years ago
BayesWatch / pytorch-moonshine
Cheap distillation for convolutional neural networks.
☆33Updated 6 years ago
gan3sh500 / octaveconv-pytorch
Implementation of Octave Convolution from Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convol…
☆57Updated 6 years ago
lolemacs / soft-sharing
Implementation of soft parameter sharing for neural networks
☆69Updated 4 years ago
JiJingYu / delta_orthogonal_init_pytorch
Delta Orthogonal Initialization for PyTorch
☆18Updated 7 years ago
cc-hpc-itwm / GradVis
☆39Updated 5 years ago
ppwwyyxx / FRN-on-common-ImageNet-baseline
Filter Response Normalization tested on better ImageNet baselines.
☆35Updated 5 years ago
znxlwm / pytorch-apex-experiment
Simple experiment of Apex (A PyTorch Extension)
☆47Updated 5 years ago
vinbhaskara / adams
Exploiting Uncertainty of Loss Landscape for Stochastic Optimization
☆15Updated 6 years ago
csrhddlam / pytorch-checkpoint
☆165Updated 6 years ago
keskarnitish / large-batch-training
Code to reproduce some of the figures in the paper "On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima"
☆144Updated 8 years ago
taoyang1122 / GradAug
[NeurIPS'20] GradAug: A New Regularization Method for Deep Neural Networks
☆94Updated 4 years ago
rahulkidambi / AccSGD
Implements pytorch code for the Accelerated SGD algorithm.
☆215Updated 7 years ago
moskomule / cca.pytorch
CCAs for looking into DNNs
☆70Updated 5 years ago