briancheung / superposition
☆45Updated 5 years ago
Alternatives and similar repositories for superposition:
Users that are interested in superposition are comparing it to the libraries listed below
- Gradient Starvation: A Learning Proclivity in Neural Networks☆61Updated 4 years ago
- This repository is no longer maintained. Check☆81Updated 4 years ago
- Winning Solution of the NeurIPS 2020 Competition on Predicting Generalization in Deep Learning☆38Updated 3 years ago
- Code for "Supermasks in Superposition"☆121Updated last year
- Code for "Online Learned Continual Compression with Adaptive Quantization Modules"☆27Updated 4 years ago
- A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification f…☆43Updated 4 years ago
- Growing Dual-Memory Self-Organizing Networks☆25Updated 5 years ago
- Recurrent Back Propagation, Back Propagation Through Optimization, ICML 2018☆41Updated 6 years ago
- ☆34Updated 3 years ago
- Official Code Repository for La-MAML: Look-Ahead Meta-Learning for Continual Learning"☆74Updated 4 years ago
- ☆35Updated last year
- A pytorch compatible data loader to create sequence of tasks for Continual Learning☆33Updated 4 years ago
- Implementation of Methods Proposed in Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks (NeurIPS 2019)☆33Updated 4 years ago
- Implementation of Information Dropout☆39Updated 7 years ago
- [CVPR 2021] "The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models" Tianlong Chen, Jon…☆68Updated 2 years ago
- [ICLR 2021] "Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning" by Tianlong Chen*, Zhenyu Zhang*, Sijia Liu, S…☆23Updated 3 years ago
- Bootstrap Your Own Latent (BYOL) pytorch implementation using DistributedDataParallel.☆28Updated 2 years ago
- Net2Net implementation on PyTorch for any possible vision layers.☆38Updated 7 years ago
- Energy-Aware Neural Architecture Optimization with Fast Splitting Steepest Descent☆13Updated 4 years ago
- Code for "Picking Winning Tickets Before Training by Preserving Gradient Flow" https://openreview.net/pdf?id=SkgsACVKPH☆102Updated 4 years ago
- Computing various norms/measures on over-parametrized neural networks☆49Updated 6 years ago
- Overcoming Catastrophic Forgetting by Incremental Moment Matching (IMM)☆34Updated 7 years ago
- Hybrid Discriminative-Generative Training via Contrastive Learning☆75Updated last year
- An adaptive training algorithm for residual network☆15Updated 4 years ago
- ☆44Updated 4 years ago
- ☆34Updated 3 years ago
- Low-variance, efficient and unbiased gradient estimation for optimizing models with binary latent variables. (ICLR 2019)☆28Updated 5 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆39Updated 4 years ago
- ☆40Updated last year
- ☆26Updated 4 years ago