mmahesh / variants-of-rmsprop-and-adagradLinks
SC-Adagrad, SC-RMSProp and RMSProp algorithms for training deep networks proposed in
☆14Updated 6 years ago
Alternatives and similar repositories for variants-of-rmsprop-and-adagrad
Users that are interested in variants-of-rmsprop-and-adagrad are comparing it to the libraries listed below
Sorting:
- Computing various norms/measures on over-parametrized neural networks☆49Updated 6 years ago
- Code for "Adversarial Distillation of Bayesian Neural Network Posteriors" https://arxiv.org/abs/1806.10317☆15Updated 6 years ago
- Lua implementation of Entropy-SGD☆82Updated 7 years ago
- Implementation of the Deep Frank-Wolfe Algorithm -- Pytorch☆62Updated 4 years ago
- ☆26Updated 6 years ago
- ☆63Updated 8 years ago
- ☆13Updated 7 years ago
- Code to reproduce some of the figures in the paper "On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima"☆144Updated 8 years ago
- SmoothOut: Smoothing Out Sharp Minima to Improve Generalization in Deep Learning☆23Updated 6 years ago
- SGD and Ordered SGD codes for deep learning, SVM, and logistic regression☆35Updated 4 years ago
- code for steinGAN - Learning to Draw Samples: With Application to Amortized MLE for Generative Adversarial Learning☆27Updated 6 years ago
- Provable Robustness of ReLU networks via Maximization of Linear Regions [AISTATS 2019]☆32Updated 4 years ago
- ☆133Updated 7 years ago
- Code for "Unifying distillation and privileged information" (ICLR 2016).☆48Updated 9 years ago
- Proximal Backpropagation - a neural network training algorithm that takes implicit instead of explicit gradient steps☆42Updated 6 years ago
- Spherical random features for polynomial kernels☆10Updated 9 years ago
- Collection of algorithms for approximating Fisher Information Matrix for Natural Gradient (and second order method in general)☆139Updated 6 years ago
- ☆27Updated 6 years ago
- Code for reproducing the results on the MNIST dataset in the paper "Distributional Smoothing with Virtual Adversarial Training"☆110Updated 8 years ago
- Implements pytorch code for the Accelerated SGD algorithm.☆215Updated 7 years ago
- ☆46Updated 7 years ago
- GANs with multiple Discriminators☆78Updated 2 years ago
- Related materials for robust and explainable machine learning☆48Updated 7 years ago
- Implementation of the paper "Meta-Learning by Adjusting Priors Based on Extended PAC-Bayes Theory", Ron Amit and Ron Meir, ICML 2018☆22Updated 5 years ago
- AAAI & CVPR 2016: Preconditioned Stochastic Gradient Langevin Dynamics (pSGLD)☆35Updated 6 years ago
- Public code for a paper "Lipschitz-Margin Training: Scalable Certification of Perturbation Invariance for Deep Neural Networks."☆34Updated 6 years ago
- Multiplicative Normalizing Flow (MNF) posteriors for variational Bayesian neural networks☆65Updated 4 years ago
- ☆30Updated 4 years ago
- Implementation of Methods Proposed in Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks (NeurIPS 2019)☆35Updated 5 years ago
- The code for the paper: https://arxiv.org/abs/1806.06317☆24Updated 6 years ago