gabrieleilertsen / nws
Dissecting the weight space of neural networks
☆17Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for nws
- ☆34Updated last year
- Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction☆35Updated 2 years ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- Active and Sample-Efficient Model Evaluation☆24Updated 3 years ago
- ☆12Updated 5 years ago
- ☆36Updated 2 years ago
- Winning Solution of the NeurIPS 2020 Competition on Predicting Generalization in Deep Learning☆39Updated 3 years ago
- Code base for SRSGD.☆28Updated 4 years ago
- Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"☆40Updated last year
- Reproducible code for Augmentation paper☆18Updated 5 years ago
- ☆19Updated 3 years ago
- SGD with large step sizes learns sparse features [ICML 2023]☆32Updated last year
- ☆25Updated 4 years ago
- ☆52Updated 3 months ago
- [JMLR] TRADES + random smoothing for certifiable robustness☆14Updated 4 years ago
- Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …☆39Updated 4 years ago
- CIFAR-5m dataset☆39Updated 3 years ago
- Improving Transformation Invariance in Contrastive Representation Learning☆13Updated 3 years ago
- Code accompanying our paper "Finding trainable sparse networks through Neural Tangent Transfer" to be published at ICML-2020.☆13Updated 4 years ago
- [NeurIPS 2020] Coresets for Robust Training of Neural Networks against Noisy Labels☆33Updated 3 years ago
- ☆19Updated 2 years ago
- Pytorch implementation for "The Surprising Positive Knowledge Transfer in Continual 3D Object Shape Reconstruction"☆33Updated 2 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆28Updated last year
- Parameter-Space Saliency Maps for Explainability☆22Updated last year
- ☆37Updated 3 years ago
- Code for Accelerated Linearized Laplace Approximation for Bayesian Deep Learning (ELLA, NeurIPS 22')☆16Updated 2 years ago
- Label shift experiments☆15Updated 3 years ago
- Code for ICLR 2022 Paper, "Controlling Directions Orthogonal to a Classifier"☆34Updated last year
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆17Updated 5 years ago
- MDL Complexity computations and experiments from the paper "Revisiting complexity and the bias-variance tradeoff".☆18Updated last year