hushon / JAX-ResNet-CIFAR10Links
Simple CIFAR10 ResNet example with JAX.
☆23Updated 4 years ago
Alternatives and similar repositories for JAX-ResNet-CIFAR10
Users that are interested in JAX-ResNet-CIFAR10 are comparing it to the libraries listed below
Sorting:
- Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)☆16Updated 6 years ago
- ☆73Updated last year
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆38Updated 3 years ago
- Training vision models with full-batch gradient descent and regularization☆39Updated 2 years ago
- ☆34Updated last year
- ☆16Updated 2 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated 2 years ago
- Source code of "What can linearized neural networks actually say about generalization?☆20Updated 4 years ago
- [NeurIPS 2021] A Geometric Analysis of Neural Collapse with Unconstrained Features☆60Updated 3 years ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆19Updated 6 years ago
- {KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch☆217Updated last month
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆132Updated 6 years ago
- ☆23Updated 3 years ago
- Coresets via Bilevel Optimization☆68Updated 5 years ago
- Code for "The Intrinsic Dimension of Images and Its Impact on Learning" - ICLR 2021 Spotlight https://openreview.net/forum?id=XJk19XzGq2J☆72Updated last year
- ☆28Updated 2 years ago
- Neural Tangent Kernel Papers☆121Updated last year
- ☆59Updated 2 years ago
- The Pitfalls of Simplicity Bias in Neural Networks [NeurIPS 2020] (http://arxiv.org/abs/2006.07710v2)☆42Updated last year
- Weight-Averaged Sharpness-Aware Minimization (NeurIPS 2022)☆28Updated 3 years ago
- Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"☆41Updated 3 years ago
- Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]☆28Updated 2 years ago
- ☆58Updated 2 years ago
- Code for the paper "Understanding Generalization through Visualizations"☆65Updated 5 years ago
- Train ImageNet *fast* in 500 lines of code with FFCV☆149Updated last year
- An Investigation of Why Overparameterization Exacerbates Spurious Correlations☆30Updated 5 years ago
- Mode Connectivity and Fast Geometric Ensembles in PyTorch☆282Updated 3 years ago
- Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …☆41Updated 5 years ago
- Lightweight torch implementation of rigl, a sparse-to-sparse optimizer.☆60Updated 4 years ago
- ☆56Updated 5 years ago