leo-yangli / l0-arm
Code for L0-ARM: Network Sparsification via Stochastic Binary Optimization
☆15Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for l0-arm
- Towards increasing stability of neural networks for continual learning: https://arxiv.org/abs/2006.06958.pdf (NeurIPS'20)☆75Updated last year
- Compressing Neural Networks using the Variational Information Bottleneck☆64Updated 2 years ago
- [ICLR 2021] "Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning" by Tianlong Chen*, Zhenyu Zhang*, Sijia Liu, S…☆23Updated 2 years ago
- Implementation of Effective Sparsification of Neural Networks with Global Sparsity Constraint☆28Updated 2 years ago
- Gradient Starvation: A Learning Proclivity in Neural Networks☆60Updated 3 years ago
- ☆57Updated last year
- An Investigation of Why Overparameterization Exacerbates Spurious Correlations☆30Updated 4 years ago
- A Closer Look at Accuracy vs. Robustness☆88Updated 3 years ago
- Learning To Stop While Learning To Predict☆33Updated 2 years ago
- Code for Sanity-Checking Pruning Methods: Random Tickets can Win the Jackpot☆42Updated 4 years ago
- Code for the paper "Understanding Generalization through Visualizations"☆60Updated 3 years ago
- ☆45Updated 5 years ago
- ☆71Updated last year
- ☆29Updated 4 years ago
- Code for "Picking Winning Tickets Before Training by Preserving Gradient Flow" https://openreview.net/pdf?id=SkgsACVKPH☆101Updated 4 years ago
- Low-variance, efficient and unbiased gradient estimation for optimizing models with binary latent variables. (ICLR 2019)☆28Updated 5 years ago
- Code accompanying the NeurIPS 2020 paper: WoodFisher (Singh & Alistarh, 2020)☆46Updated 3 years ago
- Code and checkpoints of compressed networks for the paper titled "HYDRA: Pruning Adversarially Robust Neural Networks" (NeurIPS 2020) (ht…☆90Updated last year
- ☆87Updated 2 years ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆35Updated 2 years ago
- Official code for ICLR 2020 paper "A Neural Dirichlet Process Mixture Model for Task-Free Continual Learning."☆98Updated 4 years ago
- Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)☆17Updated 5 years ago
- Low-variance and unbiased gradient for backpropagation through categorical random variables, with application in variational auto-encoder…☆17Updated 4 years ago
- Code for "Just Train Twice: Improving Group Robustness without Training Group Information"☆68Updated 6 months ago
- Code accompanying our paper "Finding trainable sparse networks through Neural Tangent Transfer" to be published at ICML-2020.☆13Updated 4 years ago
- Coresets via Bilevel Optimization☆65Updated 4 years ago
- Codebase for the paper "A Gradient Flow Framework for Analyzing Network Pruning"☆21Updated 3 years ago
- SmoothOut: Smoothing Out Sharp Minima to Improve Generalization in Deep Learning☆23Updated 6 years ago
- [ICML 2021] This is the official github repo for training L_inf dist nets with high certified accuracy.☆41Updated 2 years ago