noahgolmant / pytorch-lars
"Layer-wise Adaptive Rate Scaling" in PyTorch
☆86Updated 4 years ago
Alternatives and similar repositories for pytorch-lars:
Users that are interested in pytorch-lars are comparing it to the libraries listed below
- An implementation of shampoo☆74Updated 7 years ago
- Code for "EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis" https://arxiv.org/abs/1905.05934☆112Updated 5 years ago
- This repository is no longer maintained. Check☆81Updated 4 years ago
- A Re-implementation of Fixed-update Initialization☆152Updated 5 years ago
- Implementation of the reversible residual network in pytorch☆104Updated 3 years ago
- ☆62Updated 4 years ago
- Distributed, mixed-precision training with PyTorch☆89Updated 4 years ago
- Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".☆62Updated 5 years ago
- [NeurIPS'20] GradAug: A New Regularization Method for Deep Neural Networks☆93Updated 4 years ago
- Cheap distillation for convolutional neural networks.☆33Updated 6 years ago
- Implementation of soft parameter sharing for neural networks☆69Updated 4 years ago
- PyTorch implementation of shake-drop regularization☆54Updated 4 years ago
- An official collection of code in different frameworks that reproduces experiments in "Group Normalization"☆119Updated 4 years ago
- Simple implementation of the LSUV initialization in PyTorch☆58Updated last year
- Filter Response Normalization tested on better ImageNet baselines.☆35Updated 4 years ago
- PyProf2: PyTorch Profiling tool☆82Updated 4 years ago
- Implements pytorch code for the Accelerated SGD algorithm.☆215Updated 7 years ago
- A PyTorch implementation of shake-shake☆111Updated 4 years ago
- Simple experiment of Apex (A PyTorch Extension)☆47Updated 5 years ago
- Implementation of Octave Convolution from Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convol…☆57Updated 5 years ago
- Unofficial PyTorch Implementation of EvoNorm☆121Updated 3 years ago
- homura is a library for fast prototyping DL research☆107Updated 2 years ago
- This repository contains code to replicate the experiments given in NeurIPS 2019 paper "One ticket to win them all: generalizing lottery …☆51Updated 8 months ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆149Updated 7 years ago
- Utilities for Pytorch☆89Updated 2 years ago
- This project is the Torch implementation of our accepted AAAI 2018 paper : orthogonal weight normalization method for solving orthogonali…☆57Updated 5 years ago
- [NeurIPS '18] "Can We Gain More from Orthogonality Regularizations in Training Deep CNNs?" Official Implementation.☆128Updated 3 years ago
- Model Parallelism for pytorch training multiple network on multiple GPUs.☆28Updated 7 years ago
- On Network Design Spaces for Visual Recognition☆94Updated 4 years ago
- Code to reproduce some of the figures in the paper "On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima"☆139Updated 7 years ago