tyohei / chainerkfac
A Chainer extension for K-FAC
☆20Updated 5 years ago
Alternatives and similar repositories for chainerkfac:
Users that are interested in chainerkfac are comparing it to the libraries listed below
- Limitations of the Empirical Fisher Approximation☆47Updated 4 years ago
- This repository is no longer maintained. Check☆81Updated 4 years ago
- Regularization, Neural Network Training Dynamics☆14Updated 5 years ago
- Distributed K-FAC Preconditioner for PyTorch☆85Updated last week
- TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".☆60Updated 6 years ago
- Lua implementation of Entropy-SGD☆81Updated 6 years ago
- ☆82Updated 5 years ago
- Natural Gradient, Variational Inference☆29Updated 5 years ago
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆141Updated last year
- Code for "Accelerating Natural Gradient with Higher-Order Invariance"☆30Updated 5 years ago
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆130Updated 5 years ago
- Collection of algorithms for approximating Fisher Information Matrix for Natural Gradient (and second order method in general)☆137Updated 5 years ago
- ☆36Updated 3 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆72Updated 6 months ago
- ☆74Updated 5 years ago
- Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".☆62Updated 5 years ago
- ☆132Updated 7 years ago
- Computing various norms/measures on over-parametrized neural networks☆49Updated 6 years ago
- Structured matrices for compressing neural networks☆66Updated last year
- A PyTorch implementation of the paper "Decoupled Parallel Backpropagation with Convergence Guarantee"☆29Updated 6 years ago
- Computing the eigenvalues of Neural Tangent Kernel and Conjugate Kernel (aka NNGP kernel) over the boolean cube☆48Updated 5 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆40Updated 5 years ago
- Code base for SRSGD.☆28Updated 4 years ago
- Monotone operator equilibrium networks☆51Updated 4 years ago
- Experiments with Neural ODEs and Adversarial Attacks☆44Updated 6 years ago
- Code for "A Spectral Approach to Gradient Estimation for Implicit Distributions" (ICML'18)☆32Updated last year
- PyTorch implementation of Hessian Free optimisation☆43Updated 5 years ago
- ☆47Updated 5 years ago
- Convolutional Neural Tangent Kernel☆109Updated 5 years ago
- SGD and Ordered SGD codes for deep learning, SVM, and logistic regression☆35Updated 4 years ago