mmahesh / variants-of-rmsprop-and-adagradLinks

SC-Adagrad, SC-RMSProp and RMSProp algorithms for training deep networks proposed in

☆14

Alternatives and similar repositories for variants-of-rmsprop-and-adagrad

Users that are interested in variants-of-rmsprop-and-adagrad are comparing it to the libraries listed below

Sorting:

bneyshabur / over-parametrization
Computing various norms/measures on over-parametrized neural networks
☆49Updated 6 years ago
wangkua1 / apd_public
Code for "Adversarial Distillation of Bayesian Neural Network Posteriors" https://arxiv.org/abs/1806.10317
☆15Updated 6 years ago
ucla-vision / entropy-sgd
Lua implementation of Entropy-SGD
☆82Updated 7 years ago
oval-group / dfw
Implementation of the Deep Frank-Wolfe Algorithm -- Pytorch
☆62Updated 4 years ago
fKunstner / fast-individual-gradients-with-autodiff
☆26Updated 6 years ago
siddharth-agrawal / Generative-Moment-Matching-Networks
☆63Updated 8 years ago
BB-UCL / Lasagne
☆13Updated 7 years ago
keskarnitish / large-batch-training
Code to reproduce some of the figures in the paper "On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima"
☆144Updated 8 years ago
wenwei202 / smoothout
SmoothOut: Smoothing Out Sharp Minima to Improve Generalization in Deep Learning
☆23Updated 6 years ago
k9k2 / qSGD
SGD and Ordered SGD codes for deep learning, SVM, and logistic regression
☆35Updated 4 years ago
DartML / SteinGAN
code for steinGAN - Learning to Draw Samples: With Application to Amortized MLE for Generative Adversarial Learning
☆27Updated 6 years ago
max-andr / provable-robustness-max-linear-regions
Provable Robustness of ReLU networks via Maximization of Linear Regions [AISTATS 2019]
☆32Updated 4 years ago
yaroslavvb / kfac_pytorch
☆133Updated 7 years ago
lopezpaz / distillation_privileged_information
Code for "Unifying distillation and privileged information" (ICLR 2016).
☆48Updated 9 years ago
tfrerix / proxprop
Proximal Backpropagation - a neural network training algorithm that takes implicit instead of explicit gradient steps
☆42Updated 6 years ago
felixyu / SRF
Spherical random features for polynomial kernels
☆10Updated 9 years ago
wiseodd / natural-gradients
Collection of algorithms for approximating Fisher Information Matrix for Natural Gradient (and second order method in general)
☆139Updated 6 years ago
wendazhou / nnet-compression-generalization
☆27Updated 6 years ago
takerum / vat
Code for reproducing the results on the MNIST dataset in the paper "Distributional Smoothing with Virtual Adversarial Training"
☆110Updated 8 years ago
rahulkidambi / AccSGD
Implements pytorch code for the Accelerated SGD algorithm.
☆215Updated 7 years ago
vsyrgkanis / optimistic_GAN_training
☆46Updated 7 years ago
iDurugkar / GMAN
GANs with multiple Discriminators
☆78Updated 2 years ago
dongyp13 / Robust-and-Explainable-Machine-Learning
Related materials for robust and explainable machine learning
☆48Updated 7 years ago
ron-amit / meta-learning-adjusting-priors
Implementation of the paper "Meta-Learning by Adjusting Priors Based on Extended PAC-Bayes Theory", Ron Amit and Ron Meir, ICML 2018
☆22Updated 5 years ago
ChunyuanLI / pSGLD
AAAI & CVPR 2016: Preconditioned Stochastic Gradient Langevin Dynamics (pSGLD)
☆35Updated 6 years ago
ytsmiling / lmt
Public code for a paper "Lipschitz-Margin Training: Scalable Certification of Perturbation Invariance for Deep Neural Networks."
☆34Updated 6 years ago
AMLab-Amsterdam / MNF_VBNN
Multiplicative Normalizing Flow (MNF) posteriors for variational Bayesian neural networks
☆65Updated 4 years ago
eugenium / DGL
☆30Updated 4 years ago
ColinQiyangLi / LConvNet
Implementation of Methods Proposed in Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks (NeurIPS 2019)
☆35Updated 5 years ago
BaoWangMath / LaplacianSmoothing-GradientDescent
The code for the paper: https://arxiv.org/abs/1806.06317
☆24Updated 6 years ago