gd-zhang / noisy-quadratic-modelLinks
Large-batch Training, Neural Network Optimization
☆9Updated 5 years ago
Alternatives and similar repositories for noisy-quadratic-model
Users that are interested in noisy-quadratic-model are comparing it to the libraries listed below
Sorting:
- TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".☆60Updated 6 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆40Updated 5 years ago
- Recurrent Back Propagation, Back Propagation Through Optimization, ICML 2018☆41Updated 6 years ago
- ☆26Updated 6 years ago
- Code for Unbiased Implicit Variational Inference (UIVI)☆14Updated 6 years ago
- Code to accompany the paper Radial Bayesian Neural Networks: Beyond Discrete Support In Large-Scale Bayesian Deep Learning☆33Updated 5 years ago
- Public Codebase for Rethinking Parameter Counting: Effective Dimensionality Revisited☆37Updated 2 years ago
- Code for "Exponential Family Estimation via Adversarial Dynamics Embedding" (NeurIPS 2019)☆13Updated 5 years ago
- Code for "Bridging the Gap between f-GANs and Wasserstein GANs", ICML 2020☆14Updated 4 years ago
- ☆25Updated 5 years ago
- Code for "Accelerating Natural Gradient with Higher-Order Invariance"☆30Updated 5 years ago
- Implementation of iterative inference in deep latent variable models☆43Updated 5 years ago
- Code for the Thermodynamic Variational Objective☆26Updated 2 years ago
- ☆49Updated 4 years ago
- Code accompanying VarGrad: A Low-Variance Gradient Estimator for Variational Inference☆12Updated 4 years ago
- Computing various norms/measures on over-parametrized neural networks☆49Updated 6 years ago
- Geometric Certifications of Neural Nets☆41Updated 2 years ago
- ☆12Updated 5 years ago
- Monotone operator equilibrium networks☆52Updated 4 years ago
- Limitations of the Empirical Fisher Approximation☆47Updated 3 months ago
- Pytorch optimizers implementing Hilbert Constrained Gradient Descent☆19Updated 6 years ago
- Echo Noise Channel for Exact Mutual Information Calculation☆17Updated 4 years ago
- Implementation of Information Dropout☆39Updated 7 years ago
- Implementation of Methods Proposed in Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks (NeurIPS 2019)☆35Updated 4 years ago
- The Variational Homoencoder: Learning to learn high capacity generative models from few examples☆34Updated last year
- Code from the article: "The Role of Disentanglement in Generalisation" (ICLR, 2021).☆22Updated 3 years ago
- Code for "Training Deep Energy-Based Models with f-Divergence Minimization" ICML 2020☆36Updated 2 years ago
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10Updated 7 years ago
- Experiments for Meta-Learning Symmetries by Reparameterization☆56Updated 4 years ago
- ☆34Updated 6 years ago