NVlabs / AdaBatchLinks

AdaBatch: Adaptive Batch Sizes for Training Deep Neural Networks

☆42

Alternatives and similar repositories for AdaBatch

Users that are interested in AdaBatch are comparing it to the libraries listed below

Sorting:

BayesWatch / deficient-efficient
Successfully training approximations to full-rank matrices for efficiency in deep learning.
☆17Updated 4 years ago
eladhoffer / norm_matters
☆23Updated 6 years ago
evcu / pytorchpruner
☆22Updated 7 years ago
liamcli / darts_asha
Code release to reproduce ASHA experiments from "Random Search and Reproducibility for NAS."
☆22Updated 5 years ago
tbachlechner / ReZero-examples
PyTorch Examples repo for "ReZero is All You Need: Fast Convergence at Large Depth"
☆61Updated 11 months ago
ducha-aiki / LSUV-pytorch
Simple implementation of the LSUV initialization in PyTorch
☆58Updated last year
BayesWatch / pytorch-blockswap
Code for BlockSwap (ICLR 2020).
☆33Updated 4 years ago
JiJingYu / delta_orthogonal_init_pytorch
Delta Orthogonal Initialization for PyTorch
☆18Updated 7 years ago
uclaml / Padam
Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep …
☆39Updated 2 years ago
noahgolmant / pytorch-lars
"Layer-wise Adaptive Rate Scaling" in PyTorch
☆87Updated 4 years ago
stevenygd / SWALP
Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".
☆62Updated 6 years ago
minhtannguyen / SRSGD
Code base for SRSGD.
☆28Updated 5 years ago
noahgolmant / pytorch-lr-dropout
"Learning Rate Dropout" in PyTorch
☆34Updated 5 years ago
eladhoffer / fix_your_classifier
☆34Updated 6 years ago
wenwei202 / autogrow
AutoGrow: Automatic Layer Growing in Deep Convolutional Networks (KDD 2020)
☆39Updated 6 years ago
RUSH-LAB / LSH_Memory
One-Shot Learning using Nearest-Neighbor Search (NNS) and Locality-Sensitive Hashing LSH
☆74Updated 7 years ago
lolemacs / soft-sharing
Implementation of soft parameter sharing for neural networks
☆70Updated 4 years ago
xternalz / SDPoint
Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks
☆18Updated 5 years ago
moskomule / shampoo.pytorch
An implementation of shampoo
☆75Updated 7 years ago
fabio-deep / ReZero-ResNet
Unofficial pytorch implementation of ReZero in ResNet
☆23Updated 5 years ago
znxlwm / pytorch-apex-experiment
Simple experiment of Apex (A PyTorch Extension)
☆47Updated 5 years ago
kcyu2014 / multi-model-forgetting
ICML2019 Accepted Paper. Overcoming Multi-Model Forgetting
☆14Updated 6 years ago
keivanalizadeh / ButterflyTransform
☆41Updated 4 years ago
vinbhaskara / adams
Exploiting Uncertainty of Loss Landscape for Stochastic Optimization
☆15Updated 6 years ago
biswajitsc / sparse-embed
Code for paper 'Minimizing FLOPs to Learn Efficient Sparse Representations' published at ICLR 2020
☆20Updated 5 years ago
cc-hpc-itwm / GradVis
☆39Updated 5 years ago
eBay / AutoOpt
Automatic and Simultaneous Adjustment of Learning Rate and Momentum for Stochastic Gradient Descent
☆45Updated 4 years ago
shivram1987 / diffGrad
diffGrad: An Optimization Method for Convolutional Neural Networks
☆55Updated 2 years ago
xjtushujun / Meta-weight-net_class-imbalance
NeurIPS'19: Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting (Pytorch implementation for class imbalance).
☆34Updated 5 years ago
vfdev-5 / UDA-pytorch
Unsupervised Data Augmentation experiments in PyTorch
☆60Updated 5 years ago