ernoult / scalingDTP
"Towards Scaling Difference Target Propagation by Learning Backprop Targets" (ICML 2022)
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for scalingDTP
- Python implementation of the methods in Meulemans et al. 2020 - A Theoretical Framework For Target Propagation☆28Updated 3 weeks ago
- ☆15Updated last year
- [ICLR 2021] "Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning" by Tianlong Chen*, Zhenyu Zhang*, Sijia Liu, S…☆23Updated 2 years ago
- Codebase for Mechanistic Mode Connectivity☆13Updated last year
- [ECMLPKDD 2020] "Topological Insights into Sparse Neural Networks"☆11Updated 2 years ago
- [ICML 2024] SINGD: KFAC-like Structured Inverse-Free Natural Gradient Descent (http://arxiv.org/abs/2312.05705)☆20Updated 2 weeks ago
- Code for the ICML 2021 and ICLR 2022 papers: Skew Orthogonal Convolutions, Improved deterministic l2 robustness on CIFAR-10 and CIFAR-100☆18Updated 2 years ago
- SGD with large step sizes learns sparse features [ICML 2023]☆32Updated last year
- Prospect Pruning: Finding Trainable Weights at Initialization Using Meta-Gradients☆29Updated 2 years ago
- ☆19Updated 3 years ago
- ☆17Updated 2 years ago
- Deep Learning & Information Bottleneck☆50Updated last year
- Offical Repo for Firefly Neural Architecture Descent: a General Approach for Growing Neural Networks. Accepted by Neurips 2020.☆30Updated 4 years ago
- ☆10Updated 2 years ago
- Lightweight torch implementation of rigl, a sparse-to-sparse optimizer.☆55Updated 3 years ago
- Code accompanying the NeurIPS 2020 paper: WoodFisher (Singh & Alistarh, 2020)☆46Updated 3 years ago
- ☆13Updated 3 years ago
- ☆36Updated 2 years ago
- ☆21Updated last year
- Code for Sanity-Checking Pruning Methods: Random Tickets can Win the Jackpot☆43Updated 4 years ago
- [ICLR2023] NTK-SAP: Improving neural network pruning by aligning training dynamics☆17Updated last year
- ☆21Updated last year
- Updates of Equilibrium Prop Match Gradients of Backprop Through Time in an RNN with Static Input (NeurIPS 2019)☆12Updated 7 months ago
- Code base for SRSGD.☆28Updated 4 years ago
- Public code for Illing, Ventura, Bellec & Gerstner 2021: Local plasticity rules can learn deep representations using self-supervised cont…☆24Updated 7 months ago
- ☆58Updated last year
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆15Updated 3 years ago
- PyTorch implementation of Mixer-nano (#parameters is 0.67M, originally Mixer-S/16 has 18M) with 90.83 % acc. on CIFAR-10. Training from s…☆28Updated 3 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- Code for "Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?" [ICML 2023]☆31Updated 2 months ago