mansheej / lth_diet
☆10Updated 2 years ago
Alternatives and similar repositories for lth_diet:
Users that are interested in lth_diet are comparing it to the libraries listed below
- ☆35Updated last year
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- ☆36Updated 3 years ago
- CIFAR10 ResNets implemented in JAX+Flax☆12Updated 2 years ago
- SGD with large step sizes learns sparse features [ICML 2023]☆32Updated last year
- ☆19Updated 2 years ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆35Updated 2 years ago
- CIFAR-5m dataset☆38Updated 4 years ago
- ☆17Updated 2 years ago
- ☆65Updated 2 months ago
- Source code of "What can linearized neural networks actually say about generalization?☆20Updated 3 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- ☆11Updated 2 years ago
- A centralized place for deep thinking code and experiments☆82Updated last year
- Simple data balancing baselines for worst-group-accuracy benchmarks.☆41Updated last year
- Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …☆39Updated 4 years ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆55Updated last year
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆35Updated 2 years ago
- Recycling diverse models☆44Updated 2 years ago
- Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).☆58Updated 3 years ago
- ☆60Updated 3 years ago
- Explores the ideas presented in Deep Ensembles: A Loss Landscape Perspective (https://arxiv.org/abs/1912.02757) by Stanislav Fort, Huiyi …☆63Updated 4 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆29Updated 2 years ago
- Code for "The Intrinsic Dimension of Images and Its Impact on Learning" - ICLR 2021 Spotlight https://openreview.net/forum?id=XJk19XzGq2J☆66Updated 10 months ago
- ☆54Updated 4 years ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- Deep Learning & Information Bottleneck☆57Updated last year
- Fast training of unitary deep network layers from low-rank updates☆28Updated 2 years ago
- Public Codebase for Rethinking Parameter Counting: Effective Dimensionality Revisited☆37Updated 2 years ago
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆60Updated 3 years ago