IBM / online-alt-min
Source code for paper Choromanska et al. -- Beyond Backprop: Online Alternating Minimization with Auxiliary Variables -- http://proceedings.mlr.press/v97/choromanska19a.html
☆24Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for online-alt-min
- ☆45Updated 5 years ago
- Reliable Uncertainty Estimates in Deep Neural Networks using Noise Contrastive Priors☆62Updated 4 years ago
- This repository is no longer maintained. Check☆82Updated 4 years ago
- Limitations of the Empirical Fisher Approximation☆45Updated 4 years ago
- A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification f…☆44Updated 4 years ago
- Implementation of the paper "Meta-Learning by Adjusting Priors Based on Extended PAC-Bayes Theory", Ron Amit and Ron Meir, ICML 2018☆22Updated 5 years ago
- Delta Orthogonal Initialization for PyTorch☆18Updated 6 years ago
- Code to accompany the paper Radial Bayesian Neural Networks: Beyond Discrete Support In Large-Scale Bayesian Deep Learning☆33Updated 4 years ago
- Repository with code for paper "Inhibited Softmax for Uncertainty Estimation in Neural Networks"☆25Updated 5 years ago
- Gradient Starvation: A Learning Proclivity in Neural Networks☆60Updated 3 years ago
- The Deep Weight Prior, ICLR 2019☆44Updated 3 years ago
- Code for Stochastic Hyperparameter Optimization through Hypernetworks☆23Updated 6 years ago
- The original code for the paper "How to train your MAML" along with a replication of the original "Model Agnostic Meta Learning" (MAML) p…☆40Updated 4 years ago
- ☆25Updated 2 years ago
- Meta-learning learning rates with higher☆12Updated 5 years ago
- ☆61Updated last year
- Implementation of Information Dropout☆39Updated 7 years ago
- Piecewise Linear Functions (PWL) implementation in PyTorch☆48Updated 2 years ago
- Implementation of Bayesian Gradient Descent☆37Updated last year
- ☆70Updated 4 years ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- Reproducible code for Augmentation paper☆18Updated 5 years ago
- Growing Dual-Memory Self-Organizing Networks☆25Updated 5 years ago
- SGD and Ordered SGD codes for deep learning, SVM, and logistic regression☆34Updated 4 years ago
- CIFAR-5m dataset☆39Updated 3 years ago
- Code for the paper "Training Binary Neural Networks with Bayesian Learning Rule☆37Updated 2 years ago
- Implementation of "Variational Dropout and the Local Reparameterization Trick" paper with Pytorch☆50Updated 7 years ago
- Code for "Online Learned Continual Compression with Adaptive Quantization Modules"☆27Updated 4 years ago
- Code for "Depth Uncertainty in Neural Networks" (https://arxiv.org/abs/2006.08437)☆72Updated last year
- Implementation of Methods Proposed in Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks (NeurIPS 2019)☆33Updated 4 years ago