uuujf / SGDNoiseLinks
[ICML 2019] The Anisotropic Noise in Stochastic Gradient Descent: Its Behavior of Escaping from Sharp Minima and Regularization Effects
☆14Updated 5 years ago
Alternatives and similar repositories for SGDNoise
Users that are interested in SGDNoise are comparing it to the libraries listed below
Sorting:
- Neural Tangent Kernel Papers☆118Updated 9 months ago
- ☆67Updated 6 years ago
- ☆71Updated 10 months ago
- Visualization of mean field and neural tangent kernel regime☆20Updated last year
- ☆48Updated 2 years ago
- Pytorch implementation of neural processes and variants☆29Updated last year
- Official implementation of Transformer Neural Processes☆78Updated 3 years ago
- Experiments from the paper "On Second Order Behaviour in Augmented Neural ODEs"☆60Updated last year
- Refining continuous-in-depth neural networks☆42Updated 3 years ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆104Updated 5 years ago
- [ICML'21] Improved Contrastive Divergence Training of Energy Based Models☆66Updated 3 years ago
- Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness☆47Updated 4 years ago
- Sinkhorn Barycenters via Frank-Wolfe algorithm☆27Updated 5 years ago
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Updated 3 years ago
- Code for Accelerated Linearized Laplace Approximation for Bayesian Deep Learning (ELLA, NeurIPS 22')☆16Updated 3 years ago
- GflowNets, MCMC, Metropolis-Hasting, Gibbs sampling, Metropolis-adjusted Langevin, Inverse Transform Sampling, Acceptance-Rejection Metho…☆86Updated 2 years ago
- ☆28Updated 2 years ago
- Official code for the ICLR 2021 paper Neural ODE Processes☆75Updated 3 years ago
- Deep Learning & Information Bottleneck☆61Updated 2 years ago
- Code to accompany paper 'Bayesian Deep Ensembles via the Neural Tangent Kernel'☆26Updated 4 years ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆17Updated 6 years ago
- ☆14Updated 4 years ago
- ☆24Updated 3 years ago
- A set of tests for evaluating large-scale algorithms for Wasserstein-1 transport computation (NeurIPS'22).☆21Updated last year
- ☆54Updated last year
- PyTorch implementation of the paper "Continuous Wasserstein-2 Barycenter Estimation without Minimax Optimization" (ICLR 2021)☆24Updated 3 years ago
- Laplace Redux -- Effortless Bayesian Deep Learning☆44Updated 4 months ago
- Supplementary code for the paper "Meta-Solver for Neural Ordinary Differential Equations" https://arxiv.org/abs/2103.08561☆25Updated 4 years ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆35Updated 3 years ago
- Code for Knowledge-Adaptation Priors based on the NeurIPS 2021 paper by Khan and Swaroop.☆16Updated 3 years ago