f-dangel / hbp
Hessian backpropagation (HBP): PyTorch extension of backpropagation for block-diagonal curvature matrix approximations
☆20Updated 2 years ago
Alternatives and similar repositories for hbp:
Users that are interested in hbp are comparing it to the libraries listed below
- Monotone operator equilibrium networks☆51Updated 4 years ago
- Limitations of the Empirical Fisher Approximation☆47Updated 2 months ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆75Updated 9 months ago
- ☆30Updated 4 years ago
- ☆47Updated 5 years ago
- Code for Understanding and Mitigating Exploding Inverses in Invertible Neural Networks (AISTATS 2021) http://arxiv.org/abs/2006.09347☆30Updated 4 years ago
- ☆53Updated 9 months ago
- PyTorch implementation of Hessian Free optimisation☆43Updated 5 years ago
- Supplementary code for the paper "Meta-Solver for Neural Ordinary Differential Equations" https://arxiv.org/abs/2103.08561☆25Updated 4 years ago
- Distributed K-FAC preconditioner for PyTorch☆85Updated last week
- Code for "'Hey, that's not an ODE:' Faster ODE Adjoints via Seminorms" (ICML 2021)☆87Updated 2 years ago
- orbital MCMC☆10Updated 3 years ago
- Relative gradient optimization of the Jacobian term in unsupervised deep learning, NeurIPS 2020☆21Updated 4 years ago
- Code for "Accelerating Natural Gradient with Higher-Order Invariance"☆30Updated 5 years ago
- Convex potential flows☆83Updated 3 years ago
- Implicit Deep Adaptive Design (iDAD): Policy-Based Experimental Design without Likelihoods☆19Updated 3 years ago
- ☆49Updated 4 years ago
- Regularization, Neural Network Training Dynamics☆14Updated 5 years ago
- Stochastic Gradient Langevin Dynamics for Bayesian learning☆31Updated 3 years ago
- Code for Non-convex Learning via Replica Exchange Stochastic Gradient MCMC, ICML 2020.☆25Updated 4 years ago
- Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)☆17Updated 6 years ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆17Updated 5 years ago
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆132Updated 5 years ago
- Probabilistic Solution of Differential Equations☆13Updated 2 years ago
- Code accompanying VarGrad: A Low-Variance Gradient Estimator for Variational Inference☆12Updated 4 years ago
- Hessian trace estimation using PyTorch and Hutch++☆19Updated 4 years ago
- ☆15Updated 4 years ago
- Adaptive gradient descent without descent☆47Updated 3 years ago
- Euclidean Wasserstein-2 optimal transportation☆47Updated last year
- simple JAX-/NumPy-based implementations of NGD with exact/approximate Fisher Information Matrix both in parameter-space and function-spac…☆14Updated 4 years ago