Healbadbad / curveball-pytorchLinks
An Implementation of "Small steps and giant leaps: Minimal Newton solvers for Deep Learning" In pytorch
☆21Updated 6 years ago
Alternatives and similar repositories for curveball-pytorch
Users that are interested in curveball-pytorch are comparing it to the libraries listed below
Sorting:
- ☆23Updated 6 years ago
- Generalized Framework for PyTorch☆32Updated 3 years ago
- ☆34Updated 6 years ago
- An implementation of shampoo☆74Updated 7 years ago
- Implementation of the Budgeted Super Networks☆25Updated 6 years ago
- A second-order optimizer for deep networks☆25Updated 5 years ago
- SGD and Ordered SGD codes for deep learning, SVM, and logistic regression☆35Updated 4 years ago
- TensorFlow implementation of (Momentum) Stochastic Variance-Adapted Gradient.☆44Updated 7 years ago
- Implementation of Methods Proposed in Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks (NeurIPS 2019)☆35Updated 5 years ago
- Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep …☆39Updated 2 years ago
- ☆25Updated 5 years ago
- pytorch implementation of "Contrastive Multiview Coding", "Momentum Contrast for Unsupervised Visual Representation Learning", and "Unsup…☆18Updated 5 years ago
- This repository is no longer maintained. Check☆81Updated 5 years ago
- Like Moving MNIST, but way more flexible☆24Updated 4 years ago
- ☆46Updated 7 years ago
- For the reproduction of research by Agostinelli et al. Learning Activation Functions to Improve Deep Neural Networks. http://arxiv.org/ab…☆19Updated 10 years ago
- Torch implementation of orthoreg.☆15Updated 3 years ago
- Second-order optimiser for deep networks☆76Updated 6 years ago
- Implementation of the Deep Frank-Wolfe Algorithm -- Pytorch☆62Updated 4 years ago
- TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".☆60Updated 6 years ago
- This repo contains the code used for NeurIPS 2019 paper "Asymmetric Valleys: Beyond Sharp and Flat Local Minima".☆14Updated 5 years ago
- Codes for the paper "Deep Neural Networks with Multi-Branch Architectures Are Less Non-Convex"☆20Updated 4 years ago
- ☆33Updated 6 years ago
- The Singular Values of Convolutional Layers☆72Updated 6 years ago
- This project is the Torch implementation of our accepted AAAI 2018 paper : orthogonal weight normalization method for solving orthogonali…☆57Updated 5 years ago
- Lua implementation of Entropy-SGD☆82Updated 7 years ago
- A summary of my recently surveyed papers. Some papers on Arxiv with unimpressive results are not included.☆25Updated 7 years ago
- Recurrent Back Propagation, Back Propagation Through Optimization, ICML 2018☆42Updated 6 years ago
- Code for "On the Relation Between the Sharpest Directions of DNN Loss and the SGD Step Length", ICLR 2019☆11Updated 2 years ago
- Code for BlockSwap (ICLR 2020).☆33Updated 4 years ago