noahgolmant / pytorch-larsLinks
"Layer-wise Adaptive Rate Scaling" in PyTorch
☆87Updated 4 years ago
Alternatives and similar repositories for pytorch-lars
Users that are interested in pytorch-lars are comparing it to the libraries listed below
Sorting:
- Code for "EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis" https://arxiv.org/abs/1905.05934☆113Updated 5 years ago
- An implementation of shampoo☆74Updated 7 years ago
- A Re-implementation of Fixed-update Initialization☆153Updated 6 years ago
- ☆62Updated 5 years ago
- This repository is no longer maintained. Check☆81Updated 5 years ago
- An official collection of code in different frameworks that reproduces experiments in "Group Normalization"☆118Updated 4 years ago
- [NeurIPS'20] GradAug: A New Regularization Method for Deep Neural Networks☆94Updated 4 years ago
- Delta Orthogonal Initialization for PyTorch☆18Updated 6 years ago
- Utilities for Pytorch☆89Updated 2 years ago
- Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".☆62Updated 6 years ago
- Unofficial PyTorch Implementation of EvoNorm☆122Updated 3 years ago
- Filter Response Normalization tested on better ImageNet baselines.☆35Updated 5 years ago
- PyTorch implementation of shake-drop regularization☆54Updated 5 years ago
- PyProf2: PyTorch Profiling tool☆82Updated 5 years ago
- Simple experiment of Apex (A PyTorch Extension)☆47Updated 5 years ago
- A PyTorch implementation of the paper "Decoupled Parallel Backpropagation with Convergence Guarantee"☆29Updated 6 years ago
- A PyTorch implementation of shake-shake☆111Updated 5 years ago
- CCAs for looking into DNNs☆70Updated 4 years ago
- On Network Design Spaces for Visual Recognition☆95Updated 5 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆149Updated 8 years ago
- [NeurIPS 2020 Oral] Is normalization indispensable for training deep neural networks?☆34Updated 3 years ago
- Code to reproduce some of the figures in the paper "On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima"☆144Updated 8 years ago
- Implementation of soft parameter sharing for neural networks☆69Updated 4 years ago
- PyTorch DataLoader processed in multiple remote computation machines for heavy data processings☆67Updated 5 years ago
- [ICLR 2019] ProbGAN: Towards Probabilistic GAN with Theoretical Guarantees☆32Updated 5 years ago
- Zero-Shot Knowledge Distillation in Deep Networks☆67Updated 3 years ago
- Tensorflow implementation of S4L: Self-Supervised Semi-Supervised Learning☆94Updated 5 years ago
- Code release for paper "Random Search and Reproducibility for NAS"☆167Updated 5 years ago
- ☆165Updated 6 years ago
- "Learning Rate Dropout" in PyTorch☆34Updated 5 years ago