artur-deluca / landscapeviz
Visualizing the the loss landscape of Fully-Connected Neural Networks
☆44Updated last year
Alternatives and similar repositories for landscapeviz:
Users that are interested in landscapeviz are comparing it to the libraries listed below
- paper lists and information on mean-field theory of deep learning☆75Updated 5 years ago
- pyhessian is a TensorFlow module which can be used to estimate Hessian matrices☆24Updated 3 years ago
- Limitations of the Empirical Fisher Approximation☆47Updated 4 years ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆103Updated 4 years ago
- Code base for SRSGD.☆28Updated 4 years ago
- Create animations for the optimization trajectory of neural nets☆144Updated last year
- Delta Orthogonal Initialization for PyTorch☆18Updated 6 years ago
- PyTorch implementation of FIM and empirical FIM☆58Updated 6 years ago
- Structured matrices for compressing neural networks☆66Updated last year
- Monotone operator equilibrium networks☆51Updated 4 years ago
- Code for "Training Deep Energy-Based Models with f-Divergence Minimization" ICML 2020☆36Updated last year
- Collection of algorithms for approximating Fisher Information Matrix for Natural Gradient (and second order method in general)☆136Updated 5 years ago
- Padé Activation Units: End-to-end Learning of Activation Functions in Deep Neural Network☆63Updated 3 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆77Updated 4 years ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆144Updated last year
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆141Updated last year
- CIFAR-5m dataset☆38Updated 4 years ago
- Hessian spectral density estimation in TF and Jax☆120Updated 4 years ago
- Geometric Certifications of Neural Nets☆41Updated 2 years ago
- Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …☆39Updated 4 years ago
- Computing various norms/measures on over-parametrized neural networks☆49Updated 6 years ago
- ☆57Updated last year
- Code to accompany the paper Radial Bayesian Neural Networks: Beyond Discrete Support In Large-Scale Bayesian Deep Learning☆33Updated 4 years ago
- ☆15Updated 4 years ago
- NTK reading group☆88Updated 5 years ago
- repo for paper: Adaptive Checkpoint Adjoint (ACA) method for gradient estimation in neural ODE☆54Updated 3 years ago
- Hypergradient descent☆143Updated 8 months ago
- ☆53Updated 6 months ago
- Reparameterize your PyTorch modules☆70Updated 4 years ago
- Implementation of Methods Proposed in Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks (NeurIPS 2019)☆33Updated 4 years ago