uuujf / SGDNoiseLinks
[ICML 2019] The Anisotropic Noise in Stochastic Gradient Descent: Its Behavior of Escaping from Sharp Minima and Regularization Effects
☆14Updated 5 years ago
Alternatives and similar repositories for SGDNoise
Users that are interested in SGDNoise are comparing it to the libraries listed below
Sorting:
- Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness☆48Updated 4 years ago
- Pytorch implementation of neural processes and variants☆29Updated last year
- Official implementation of Transformer Neural Processes☆78Updated 3 years ago
- ☆73Updated last year
- Sinkhorn Barycenters via Frank-Wolfe algorithm☆27Updated 5 years ago
- Code for Accelerated Linearized Laplace Approximation for Bayesian Deep Learning (ELLA, NeurIPS 22')☆16Updated 3 years ago
- Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform☆43Updated 6 years ago
- Deep Learning & Information Bottleneck☆63Updated 2 years ago
- ☆68Updated 6 years ago
- Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …☆41Updated 5 years ago
- Visualization of mean field and neural tangent kernel regime☆21Updated last year
- Experiments from the paper "On Second Order Behaviour in Augmented Neural ODEs"☆61Updated last year
- Neural Tangent Kernel Papers☆120Updated 11 months ago
- Pytorch Implementation of the Nonlinear Information Bottleneck☆41Updated last year
- Code for "The Intrinsic Dimension of Images and Its Impact on Learning" - ICLR 2021 Spotlight https://openreview.net/forum?id=XJk19XzGq2J☆71Updated last year
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆106Updated 5 years ago
- Code to accompany paper 'Bayesian Deep Ensembles via the Neural Tangent Kernel'☆26Updated 4 years ago
- Code for Knowledge-Adaptation Priors based on the NeurIPS 2021 paper by Khan and Swaroop.☆16Updated 3 years ago
- Python3 implementation of the paper [Large-scale optimal transport map estimation using projection pursuit]☆15Updated 4 years ago
- Contains code for the NeurIPS 2020 paper by Pan et al., "Continual Deep Learning by FunctionalRegularisation of Memorable Past"☆44Updated 5 years ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆37Updated 3 years ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆19Updated 6 years ago
- Refining continuous-in-depth neural networks☆42Updated 4 years ago
- ☆12Updated 2 years ago
- ☆28Updated 2 years ago
- Stochastic Gradient Langevin Dynamics for Bayesian learning☆35Updated 4 years ago
- GflowNets, MCMC, Metropolis-Hasting, Gibbs sampling, Metropolis-adjusted Langevin, Inverse Transform Sampling, Acceptance-Rejection Metho…☆86Updated 2 years ago
- Official code for the ICLR 2021 paper Neural ODE Processes☆75Updated 3 years ago
- Principled learning method for Wasserstein distributionally robust optimization with local perturbations (ICML 2020)☆21Updated 2 years ago
- NeurIPS'23: Energy Discrepancies: A Score-Independent Loss for Energy-Based Models☆17Updated last year