f-dangel / hbp
Hessian backpropagation (HBP): PyTorch extension of backpropagation for block-diagonal curvature matrix approximations
☆20Updated last year
Alternatives and similar repositories for hbp:
Users that are interested in hbp are comparing it to the libraries listed below
- Limitations of the Empirical Fisher Approximation☆47Updated 4 years ago
- PyTorch implementation of Hessian Free optimisation☆43Updated 5 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆72Updated 6 months ago
- Monotone operator equilibrium networks☆51Updated 4 years ago
- Code for Understanding and Mitigating Exploding Inverses in Invertible Neural Networks (AISTATS 2021) http://arxiv.org/abs/2006.09347☆29Updated 4 years ago
- Hessian trace estimation using PyTorch and Hutch++☆19Updated 4 years ago
- Regularization, Neural Network Training Dynamics☆14Updated 5 years ago
- ☆47Updated 5 years ago
- Distributed K-FAC Preconditioner for PyTorch☆85Updated last week
- Continuous-time gradient flow for generative modeling and variational inference☆30Updated 6 years ago
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆130Updated 5 years ago
- Code for "Accelerating Natural Gradient with Higher-Order Invariance"☆30Updated 5 years ago
- Code for "'Hey, that's not an ODE:' Faster ODE Adjoints via Seminorms" (ICML 2021)☆86Updated 2 years ago
- ☆53Updated 6 months ago
- ☆31Updated 4 years ago
- Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)☆17Updated 5 years ago
- Simple and extensible hypergradient for PyTorch☆16Updated last year
- The official code for Efficient Learning of Generative Models via Finite-Difference Score Matching☆11Updated 2 years ago
- Code base for SRSGD.☆28Updated 4 years ago
- orbital MCMC☆10Updated 3 years ago
- Probabilistic Solution of Differential Equations☆13Updated 2 years ago
- repo for paper: Adaptive Checkpoint Adjoint (ACA) method for gradient estimation in neural ODE☆54Updated 3 years ago
- ☆63Updated last year
- ☆49Updated 4 years ago
- Code for Non-convex Learning via Replica Exchange Stochastic Gradient MCMC, ICML 2020.☆23Updated 4 years ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆17Updated 5 years ago
- simple JAX-/NumPy-based implementations of NGD with exact/approximate Fisher Information Matrix both in parameter-space and function-spac…☆14Updated 4 years ago
- Code for the Thermodynamic Variational Objective☆26Updated 2 years ago
- Supplementary code for the paper "Meta-Solver for Neural Ordinary Differential Equations" https://arxiv.org/abs/2103.08561☆24Updated 3 years ago
- Experiments for Meta-Learning Symmetries by Reparameterization☆56Updated 3 years ago