jotaf98 / pytorch-curveball
A second-order optimizer for deep networks
☆24Updated 5 years ago
Alternatives and similar repositories for pytorch-curveball:
Users that are interested in pytorch-curveball are comparing it to the libraries listed below
- An Implementation of "Small steps and giant leaps: Minimal Newton solvers for Deep Learning" In pytorch☆21Updated 6 years ago
- ☆27Updated 4 years ago
- This repository is no longer maintained. Check☆81Updated 4 years ago
- ☆26Updated 5 years ago
- Sliced Wasserstein Generator☆37Updated 6 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆39Updated 4 years ago
- Like Moving MNIST, but way more flexible☆24Updated 4 years ago
- Recurrent Back Propagation, Back Propagation Through Optimization, ICML 2018☆41Updated 6 years ago
- An implementation of shampoo☆74Updated 6 years ago
- Implementation of Methods Proposed in Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks (NeurIPS 2019)☆33Updated 4 years ago
- ☆40Updated last year
- "Learning Discrete and Continuous Factors of Data via Alternating Disentanglement" accepted at ICML2019☆21Updated 5 years ago
- ☆31Updated 4 years ago
- ☆42Updated 5 years ago
- Percentile computation for pytorch☆20Updated 4 years ago
- (Batched) advanced indexing for PyTorch.☆53Updated 3 weeks ago
- Implementation of iterative inference in deep latent variable models☆43Updated 5 years ago
- A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification f…☆43Updated 4 years ago
- Low-variance, efficient and unbiased gradient estimation for optimizing models with binary latent variables. (ICLR 2019)☆28Updated 5 years ago
- A PyTorch implementation of Conditional PixelCNNs☆27Updated 6 years ago
- ☆21Updated 4 years ago
- implements optimal transport algorithms in pytorch☆92Updated 2 years ago
- Monotone operator equilibrium networks☆51Updated 4 years ago
- Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep …☆39Updated last year
- EfficientMORL (ICML'21)☆22Updated 3 years ago
- A discrete sequential VAE☆38Updated 4 years ago
- TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".☆60Updated 6 years ago
- Matrix square root with gradient support for PyTorch☆67Updated 2 years ago
- Implementation of the Deep Frank-Wolfe Algorithm -- Pytorch☆62Updated 3 years ago
- Implementation of the reversible residual network in pytorch☆102Updated 3 years ago