gd-zhang / Weight-Decay
Regularization, Neural Network Training Dynamics
☆14Updated 5 years ago
Alternatives and similar repositories for Weight-Decay:
Users that are interested in Weight-Decay are comparing it to the libraries listed below
- TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".☆60Updated 6 years ago
- Natural Gradient, Variational Inference☆29Updated 5 years ago
- Limitations of the Empirical Fisher Approximation☆47Updated 3 weeks ago
- Code for the Thermodynamic Variational Objective☆26Updated 2 years ago
- Code for "Accelerating Natural Gradient with Higher-Order Invariance"☆30Updated 5 years ago
- Monotone operator equilibrium networks☆51Updated 4 years ago
- ☆36Updated 3 years ago
- Code for "A Spectral Approach to Gradient Estimation for Implicit Distributions" (ICML'18)☆33Updated last year
- ☆80Updated 3 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆40Updated 5 years ago
- ☆29Updated 4 years ago
- Experiments for the paper "Exponential expressivity in deep neural networks through transient chaos"☆70Updated 8 years ago
- Code accompanying VarGrad: A Low-Variance Gradient Estimator for Variational Inference☆12Updated 4 years ago
- Code release for the ICLR paper☆20Updated 6 years ago
- Autoregressive Energy Machines☆77Updated 2 years ago
- A public repository for our paper, Rao-Blackwellized Stochastic Gradients for Discrete Distributions☆22Updated 5 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆74Updated 8 months ago
- Computing the eigenvalues of Neural Tangent Kernel and Conjugate Kernel (aka NNGP kernel) over the boolean cube☆48Updated 5 years ago
- Experiments for Meta-Learning Symmetries by Reparameterization☆56Updated 3 years ago
- Code for Self-Tuning Networks (ICLR 2019) https://arxiv.org/abs/1903.03088☆53Updated 5 years ago
- Approximate Inference Turns Deep Networks into Gaussian Processes (dnn2gp)☆48Updated 5 years ago
- A Chainer extension for K-FAC☆20Updated 5 years ago
- Relative gradient optimization of the Jacobian term in unsupervised deep learning, NeurIPS 2020☆21Updated 3 years ago
- Large-batch Training, Neural Network Optimization☆9Updated 5 years ago
- This repository is no longer maintained. Check☆81Updated 4 years ago
- ☆31Updated 4 years ago
- Reliable Uncertainty Estimates in Deep Neural Networks using Noise Contrastive Priors☆62Updated 4 years ago
- ☆64Updated last year
- A Python implementation of the gradient REBAR estimator.☆46Updated 6 years ago
- BIVA: A Very Deep Hierarchy of Latent Variables forGenerative Modeling☆29Updated 5 years ago