uuujf / SGDNoiseLinks
[ICML 2019] The Anisotropic Noise in Stochastic Gradient Descent: Its Behavior of Escaping from Sharp Minima and Regularization Effects
☆14Updated 5 years ago
Alternatives and similar repositories for SGDNoise
Users that are interested in SGDNoise are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of neural processes and variants☆29Updated last year
- Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness☆48Updated 4 years ago
- ☆67Updated 6 years ago
- Official implementation of Transformer Neural Processes☆78Updated 3 years ago
- Neural Tangent Kernel Papers☆119Updated 10 months ago
- Visualization of mean field and neural tangent kernel regime☆21Updated last year
- Python3 implementation of the paper [Large-scale optimal transport map estimation using projection pursuit]☆15Updated 4 years ago
- Experiments from the paper "On Second Order Behaviour in Augmented Neural ODEs"☆60Updated last year
- ☆72Updated 11 months ago
- Code for Knowledge-Adaptation Priors based on the NeurIPS 2021 paper by Khan and Swaroop.☆16Updated 3 years ago
- Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform☆43Updated 6 years ago
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Updated 3 years ago
- Code to accompany paper 'Bayesian Deep Ensembles via the Neural Tangent Kernel'☆26Updated 4 years ago
- ☆28Updated 2 years ago
- ☆13Updated 4 years ago
- Code for Accelerated Linearized Laplace Approximation for Bayesian Deep Learning (ELLA, NeurIPS 22')☆16Updated 3 years ago
- Deep Learning & Information Bottleneck☆62Updated 2 years ago
- Contains code for the NeurIPS 2020 paper by Pan et al., "Continual Deep Learning by FunctionalRegularisation of Memorable Past"☆44Updated 5 years ago
- Refining continuous-in-depth neural networks☆42Updated 4 years ago
- ☆48Updated 2 years ago
- NeurIPS'23: Energy Discrepancies: A Score-Independent Loss for Energy-Based Models☆17Updated last year
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆104Updated 5 years ago
- Supporting code for the paper "Dangers of Bayesian Model Averaging under Covariate Shift"☆33Updated 3 years ago
- ☆22Updated 3 years ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆36Updated 3 years ago
- [ICML 2021] This is the official github repo for training L_inf dist nets with high certified accuracy.☆42Updated 3 years ago
- ☆30Updated 5 years ago
- ☆37Updated 5 years ago
- ☆17Updated 3 years ago
- Example code of Sparse Gaussian Process Attention (ICLR 2023)☆26Updated 2 months ago