gd-zhang / Weight-DecayLinks
Regularization, Neural Network Training Dynamics
☆14Updated 5 years ago
Alternatives and similar repositories for Weight-Decay
Users that are interested in Weight-Decay are comparing it to the libraries listed below
Sorting:
- Natural Gradient, Variational Inference☆29Updated 5 years ago
- TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".☆60Updated 6 years ago
- ☆36Updated 3 years ago
- Code for the Thermodynamic Variational Objective☆26Updated 3 years ago
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆143Updated 2 years ago
- Autoregressive Energy Machines☆78Updated 2 years ago
- Limitations of the Empirical Fisher Approximation☆47Updated 4 months ago
- ☆29Updated 4 years ago
- ☆133Updated 7 years ago
- hessian in pytorch☆187Updated 4 years ago
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆132Updated 6 years ago
- Monotone operator equilibrium networks☆53Updated 5 years ago
- Code for "A Spectral Approach to Gradient Estimation for Implicit Distributions" (ICML'18)☆33Updated 2 years ago
- Experiments for the paper "Exponential expressivity in deep neural networks through transient chaos"☆71Updated 9 years ago
- A Python implementation of the gradient REBAR estimator.☆46Updated 7 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆75Updated 11 months ago
- ☆80Updated 3 years ago
- Lagrangian VAE☆28Updated 6 years ago
- A tensorflow implementation of the NIPS 2018 paper "Variational Inference with Tail-adaptive f-Divergence"☆21Updated 6 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆40Updated 5 years ago
- Code release for the ICLR paper☆21Updated 7 years ago
- Optimization with orthogonal constraints and on general manifolds☆129Updated 5 years ago
- ☆26Updated 6 years ago
- Code for Self-Tuning Networks (ICLR 2019) https://arxiv.org/abs/1903.03088☆53Updated 6 years ago
- Code for ICML 2018 paper on "Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam" by Khan, Nielsen, Tangkaratt, Lin, …☆112Updated 6 years ago
- Code for "Accelerating Natural Gradient with Higher-Order Invariance"☆30Updated 6 years ago
- Hypergradient descent☆149Updated last year
- Convolutional Neural Tangent Kernel☆111Updated 5 years ago
- Neural likelihood-free methods in PyTorch.☆39Updated 5 years ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆147Updated last year