Offical Repo for Firefly Neural Architecture Descent: a General Approach for Growing Neural Networks. Accepted by Neurips 2020.
☆34Oct 26, 2020Updated 5 years ago
Alternatives and similar repositories for Firefly
Users that are interested in Firefly are comparing it to the libraries listed below
Sorting:
- Offical Repo for Splitting Steepest Descent for Growing Neural Architectures☆13May 12, 2021Updated 4 years ago
- Energy-Aware Neural Architecture Optimization with Fast Splitting Steepest Descent☆14Feb 6, 2020Updated 6 years ago
- [ICLR 2021] "Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning" by Tianlong Chen*, Zhenyu Zhang*, Sijia Liu, S…☆26Dec 30, 2021Updated 4 years ago
- Official PyTorch implementation of CD-MOE☆12Mar 13, 2026Updated last week
- Accepted by AAAI2022☆21Apr 10, 2022Updated 3 years ago
- [ICLR 2021] "Learning a Minimax Optimizer: A Pilot Study" by Jiayi Shen*, Xiaohan Chen*, Howard Heaton*, Tianlong Chen, Jialin Liu, Wotao…☆15Dec 30, 2021Updated 4 years ago
- ☆14Sep 22, 2020Updated 5 years ago
- ImageNet training code that implements academic defaults☆12Jul 15, 2021Updated 4 years ago
- [WACV 2022] "Sandwich Batch Normalization: A Drop-In Replacement for Feature Distribution Heterogeneity" by Xinyu Gong, Wuyang Chen, Tian…☆51Dec 29, 2021Updated 4 years ago
- [ICML 2021] "Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training" by Shiwei Liu, Lu Yin, De…☆45Nov 11, 2023Updated 2 years ago
- ☆24Jun 22, 2022Updated 3 years ago
- This is the time series forecasting models modified by xinze.zh.☆12Mar 10, 2023Updated 3 years ago
- Implementation for <Regularizing Neural Networks via Minimizing Hyperspherical Energy> in CVPR'20.☆24Jun 23, 2020Updated 5 years ago
- Code for reproducing the results in "How Well do Sparse Imagenet Models Transfer?", presented at CVPR 2022☆10Jun 3, 2022Updated 3 years ago
- a jax benchmark for ad hoc teamwork☆20Updated this week
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆57Dec 4, 2024Updated last year
- AutoGrow: Automatic Layer Growing in Deep Convolutional Networks (KDD 2020)☆40Jun 10, 2019Updated 6 years ago
- This is the official code for UGTs.☆13Feb 8, 2023Updated 3 years ago
- ☆215Nov 17, 2022Updated 3 years ago
- Dyna built on R-exprs (First Prototype)☆17Mar 7, 2022Updated 4 years ago
- Crafting Adversarial Examples for Neural Machine Translation☆10Apr 7, 2023Updated 2 years ago
- Official Code Repository for La-MAML: Look-Ahead Meta-Learning for Continual Learning"☆81Dec 16, 2020Updated 5 years ago
- ICLR 2022☆18Apr 15, 2022Updated 3 years ago
- Jax implementation of the AdaHessian optimizer☆20Mar 11, 2021Updated 5 years ago
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Apr 21, 2025Updated 11 months ago
- (CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers☆25Feb 26, 2025Updated last year
- Official Implementation of CVPR2021 paper: Continual Learning via Bit-Level Information Preserving☆39Jan 24, 2023Updated 3 years ago
- Codebase for the paper titled "Continual learning with local module selection"☆25Nov 15, 2021Updated 4 years ago
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Jul 7, 2022Updated 3 years ago
- ☆38Nov 4, 2024Updated last year
- [TPAMI 2020] "Privacy-Preserving Deep Action Recognition: An Adversarial Learning Framework and A New Dataset" by Zhenyu Wu, Haotao Wang,…☆39Dec 30, 2021Updated 4 years ago
- Comparison of method "Pruning at initialization prior to training" (Synflow/SNIP/GraSP) in PyTorch☆17May 12, 2024Updated last year
- Code base for SRSGD.☆28Mar 5, 2020Updated 6 years ago
- Gradually Updated Neural Networks for Large-Scale Image Recognition at ICML 2018☆10Jun 25, 2018Updated 7 years ago
- Implementation of Dual Learning NMT & Joint Training on tensorflow☆12Dec 29, 2018Updated 7 years ago
- [CVPR 2021] "The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models" Tianlong Chen, Jon…☆68Dec 17, 2022Updated 3 years ago
- Codebase for " Reducing Representation Drift in Online Continual Learning"☆14Jun 8, 2021Updated 4 years ago
- Mixed integer programming for computing lipschitz constants of ReLU Networks☆17Feb 10, 2023Updated 3 years ago
- Code for the paper "What does BERT know about books, movies and music? Probing BERT for Conversational Recommendation" at RecSys'20☆24Jun 22, 2022Updated 3 years ago