gd-zhang / Weight-DecayLinks
Regularization, Neural Network Training Dynamics
☆14Updated 5 years ago
Alternatives and similar repositories for Weight-Decay
Users that are interested in Weight-Decay are comparing it to the libraries listed below
Sorting:
- Natural Gradient, Variational Inference☆29Updated 5 years ago
- TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".☆60Updated 6 years ago
- Monotone operator equilibrium networks☆53Updated 5 years ago
- Limitations of the Empirical Fisher Approximation☆47Updated 6 months ago
- Optimization with orthogonal constraints and on general manifolds☆130Updated 5 years ago
- Code for the Thermodynamic Variational Objective☆26Updated 3 years ago
- Code for "Accelerating Natural Gradient with Higher-Order Invariance"☆30Updated 6 years ago
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆132Updated 6 years ago
- ☆36Updated 4 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆76Updated last year
- ☆80Updated 4 years ago
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆146Updated 2 years ago
- hessian in pytorch☆187Updated 4 years ago
- Experiment code for "Randomized Automatic Differentiation"☆67Updated 5 years ago
- ☆133Updated 7 years ago
- A Python implementation of the gradient REBAR estimator.☆46Updated 7 years ago
- ☆30Updated 4 years ago
- Experiments for the paper "Exponential expressivity in deep neural networks through transient chaos"☆72Updated 9 years ago
- Code for "A Spectral Approach to Gradient Estimation for Implicit Distributions" (ICML'18)☆33Updated 2 years ago
- Convolutional Neural Tangent Kernel☆113Updated 5 years ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆147Updated last year
- paper lists and information on mean-field theory of deep learning☆78Updated 6 years ago
- Autoregressive Energy Machines☆78Updated 2 years ago
- Hypergradient descent☆149Updated last year
- Experiments for Meta-Learning Symmetries by Reparameterization☆57Updated 4 years ago
- Hessian spectral density estimation in TF and Jax☆124Updated 5 years ago
- Code release for the ICLR paper☆21Updated 7 years ago
- Lua implementation of Entropy-SGD☆82Updated 7 years ago
- A tensorflow implementation of the NIPS 2018 paper "Variational Inference with Tail-adaptive f-Divergence"☆21Updated 6 years ago
- ☆170Updated last year