jotaf98 / pytorch-curveball
A second-order optimizer for deep networks
☆24Updated 5 years ago
Alternatives and similar repositories for pytorch-curveball:
Users that are interested in pytorch-curveball are comparing it to the libraries listed below
- This repository is no longer maintained. Check☆81Updated 4 years ago
- An Implementation of "Small steps and giant leaps: Minimal Newton solvers for Deep Learning" In pytorch☆21Updated 6 years ago
- Like Moving MNIST, but way more flexible☆24Updated 4 years ago
- Recurrent Back Propagation, Back Propagation Through Optimization, ICML 2018☆41Updated 6 years ago
- ☆41Updated 2 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆40Updated 5 years ago
- Sliced Wasserstein Generator☆37Updated 7 years ago
- ☆27Updated 4 years ago
- Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep …☆39Updated 2 years ago
- (Batched) advanced indexing for PyTorch.☆53Updated 3 months ago
- Masked Convolutional Flow☆59Updated 5 years ago
- An implementation of shampoo☆74Updated 7 years ago
- Monotone operator equilibrium networks☆51Updated 4 years ago
- Reparameterize your PyTorch modules☆71Updated 4 years ago
- Limitations of the Empirical Fisher Approximation☆47Updated last month
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆75Updated 8 months ago
- Pytorch implementation of the Power Spherical distribution☆74Updated 9 months ago
- ☆30Updated 4 years ago
- ☆42Updated 5 years ago
- Efficient reservoir sampling implementation for PyTorch☆106Updated 3 years ago
- A PyTorch implementation of Conditional PixelCNNs☆27Updated 7 years ago
- Implementation of "Variational Dropout and the Local Reparameterization Trick" paper with Pytorch☆49Updated 7 years ago
- Code for Variational Laplace Autoencoders☆54Updated last year
- Implementation of the Deep Frank-Wolfe Algorithm -- Pytorch☆62Updated 4 years ago
- ☆21Updated 5 years ago
- Implementation of Methods Proposed in Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks (NeurIPS 2019)☆34Updated 4 years ago
- Code for Self-Tuning Networks (ICLR 2019) https://arxiv.org/abs/1903.03088☆53Updated 5 years ago
- Riemannian approach to batch normalization☆18Updated 7 years ago
- Differentiable bitonic sorting☆140Updated 4 years ago
- ☆34Updated 6 years ago