gd-zhang / Weight-Decay
Regularization, Neural Network Training Dynamics
☆14Updated 5 years ago
Alternatives and similar repositories for Weight-Decay:
Users that are interested in Weight-Decay are comparing it to the libraries listed below
- Natural Gradient, Variational Inference☆29Updated 5 years ago
- TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".☆60Updated 6 years ago
- Code for the Thermodynamic Variational Objective☆26Updated 2 years ago
- Monotone operator equilibrium networks☆51Updated 4 years ago
- Limitations of the Empirical Fisher Approximation☆47Updated 2 months ago
- ☆36Updated 3 years ago
- ☆29Updated 4 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆40Updated 5 years ago
- Code for "A Spectral Approach to Gradient Estimation for Implicit Distributions" (ICML'18)☆33Updated last year
- A tensorflow implementation of the NIPS 2018 paper "Variational Inference with Tail-adaptive f-Divergence"☆21Updated 6 years ago
- Lagrangian VAE☆28Updated 6 years ago
- ☆80Updated 3 years ago
- Implementation of iterative inference in deep latent variable models☆43Updated 5 years ago
- Code for "Accelerating Natural Gradient with Higher-Order Invariance"☆30Updated 5 years ago
- Geometric Certifications of Neural Nets☆41Updated 2 years ago
- Code release for the ICLR paper☆20Updated 6 years ago
- Experiment code for "Randomized Automatic Differentiation"☆67Updated 4 years ago
- Autoregressive Energy Machines☆77Updated 2 years ago
- Code for Self-Tuning Networks (ICLR 2019) https://arxiv.org/abs/1903.03088☆53Updated 5 years ago
- Experiments for the paper "Exponential expressivity in deep neural networks through transient chaos"☆71Updated 8 years ago
- ☆37Updated 5 years ago
- A Python implementation of the gradient REBAR estimator.☆46Updated 6 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆75Updated 9 months ago
- Computing the eigenvalues of Neural Tangent Kernel and Conjugate Kernel (aka NNGP kernel) over the boolean cube☆47Updated 5 years ago
- Convolutional Neural Tangent Kernel☆111Updated 5 years ago
- Reliable Uncertainty Estimates in Deep Neural Networks using Noise Contrastive Priors☆62Updated 5 years ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆17Updated 5 years ago
- Code for "Efficient optimization of loops and limits with randomized telescoping sums"☆27Updated 5 years ago
- [NeurIPS'19] Deep Equilibrium Models Jax Implementation☆39Updated 4 years ago
- A Chainer extension for K-FAC☆20Updated 5 years ago