uuujf / SGDNoise
[ICML 2019] The Anisotropic Noise in Stochastic Gradient Descent: Its Behavior of Escaping from Sharp Minima and Regularization Effects
☆13Updated 4 years ago
Related projects: ⓘ
- ☆65Updated 5 years ago
- Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform☆34Updated 5 years ago
- ☆35Updated this week
- Official code for the ICLR 2021 paper Neural ODE Processes☆71Updated 2 years ago
- ☆56Updated 3 years ago
- Neural Tangent Kernel Papers☆84Updated 6 months ago
- ☆44Updated 10 months ago
- Experiments from the paper "On Second Order Behaviour in Augmented Neural ODEs"☆54Updated last year
- Refining continuous-in-depth neural networks☆39Updated 2 years ago
- Code to accompany paper 'Bayesian Deep Ensembles via the Neural Tangent Kernel'☆26Updated 3 years ago
- ☆10Updated 2 years ago
- Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness☆41Updated 3 years ago
- Code for "The Intrinsic Dimension of Images and Its Impact on Learning" - ICLR 2021 Spotlight https://openreview.net/forum?id=XJk19XzGq2J☆63Updated 5 months ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆34Updated 2 years ago
- Monotone operator equilibrium networks☆51Updated 4 years ago
- ☆56Updated last year
- Code for Knowledge-Adaptation Priors based on the NeurIPS 2021 paper by Khan and Swaroop.☆16Updated 2 years ago
- ☆25Updated last year
- Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …☆39Updated 3 years ago
- Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"☆40Updated last year
- Contains code for the NeurIPS 2020 paper by Pan et al., "Continual Deep Learning by FunctionalRegularisation of Memorable Past"☆43Updated 3 years ago
- Supporing code for the paper "Bayesian Model Selection, the Marginal Likelihood, and Generalization".☆34Updated 2 years ago
- Visualization of mean field and neural tangent kernel regime☆20Updated last month
- Lipschitz Neural Networks described in "Sorting Out Lipschitz Function Approximation" (ICML 2019).☆54Updated 4 years ago
- Deep Learning & Information Bottleneck☆45Updated last year
- ☆13Updated 2 years ago
- Source code of "What can linearized neural networks actually say about generalization?☆17Updated 2 years ago
- Entropic Optimal Transport Benchmark (NeurIPS 2023).☆18Updated 5 months ago
- ☆31Updated 3 years ago
- Distributional and Outlier Robust Optimization (ICML 2021)☆27Updated 3 years ago