IBM / online-alt-min
Source code for paper Choromanska et al. -- Beyond Backprop: Online Alternating Minimization with Auxiliary Variables -- http://proceedings.mlr.press/v97/choromanska19a.html
☆24Updated 4 years ago
Related projects: ⓘ
- This repository is no longer maintained. Check☆82Updated 4 years ago
- ☆45Updated 4 years ago
- The Deep Weight Prior, ICLR 2019☆44Updated 3 years ago
- Limitations of the Empirical Fisher Approximation☆45Updated 4 years ago
- Delta Orthogonal Initialization for PyTorch☆18Updated 6 years ago
- A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification f…☆44Updated 4 years ago
- Code to accompany the paper Radial Bayesian Neural Networks: Beyond Discrete Support In Large-Scale Bayesian Deep Learning☆33Updated 4 years ago
- CIFAR-5m dataset☆39Updated 3 years ago
- Reliable Uncertainty Estimates in Deep Neural Networks using Noise Contrastive Priors☆62Updated 4 years ago
- Code for Self-Tuning Networks (ICLR 2019) https://arxiv.org/abs/1903.03088☆53Updated 5 years ago
- The Singular Values of Convolutional Layers☆71Updated 5 years ago
- Code base for SRSGD.☆28Updated 4 years ago
- Implementation of Information Dropout☆39Updated 7 years ago
- Computing various norms/measures on over-parametrized neural networks☆48Updated 5 years ago
- ☆53Updated 6 years ago
- Piecewise Linear Functions (PWL) implementation in PyTorch☆47Updated 2 years ago
- Deep Neural Networks Entropy from Replicas☆31Updated 4 years ago
- Meta-learning learning rates with higher☆12Updated 4 years ago
- Gradient Starvation: A Learning Proclivity in Neural Networks☆59Updated 3 years ago
- The original code for the paper "How to train your MAML" along with a replication of the original "Model Agnostic Meta Learning" (MAML) p…☆40Updated 3 years ago
- Implementation of Methods Proposed in Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks (NeurIPS 2019)☆33Updated 4 years ago
- Padé Activation Units: End-to-end Learning of Activation Functions in Deep Neural Network☆62Updated 3 years ago
- [JMLR] TRADES + random smoothing for certifiable robustness☆14Updated 4 years ago
- PyTorch code for training neural networks without global back-propagation☆160Updated 4 years ago
- SGD and Ordered SGD codes for deep learning, SVM, and logistic regression☆34Updated 4 years ago
- Repository with code for paper "Inhibited Softmax for Uncertainty Estimation in Neural Networks"☆25Updated 5 years ago
- The code for the paper: https://arxiv.org/abs/1806.06317☆24Updated 5 years ago
- Lua implementation of Entropy-SGD☆79Updated 6 years ago
- Code for the paper "Training Binary Neural Networks with Bayesian Learning Rule☆37Updated 2 years ago
- ☆14Updated 4 years ago