Offical Repo for Firefly Neural Architecture Descent: a General Approach for Growing Neural Networks. Accepted by Neurips 2020.
☆35Oct 26, 2020Updated 5 years ago
Alternatives and similar repositories for Firefly
Users that are interested in Firefly are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Offical Repo for Splitting Steepest Descent for Growing Neural Architectures☆13May 12, 2021Updated 5 years ago
- Energy-Aware Neural Architecture Optimization with Fast Splitting Steepest Descent☆14Feb 6, 2020Updated 6 years ago
- ☆56Jul 30, 2024Updated last year
- Official pytorch code for "APP: Anytime Progressive Pruning" (DyNN @ ICML, 2022; CLL @ ACML, 2022, SNN @ ICML, 2022 and SlowDNN 2023)☆16Nov 22, 2022Updated 3 years ago
- [ICLR 2021] "Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning" by Tianlong Chen*, Zhenyu Zhang*, Sijia Liu, S…☆26Dec 30, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementations of growing and pruning in neural networks☆22Jul 26, 2023Updated 2 years ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated 2 months ago
- Accepted by AAAI2022☆21Apr 10, 2022Updated 4 years ago
- [ICLR 2021] "Learning a Minimax Optimizer: A Pilot Study" by Jiayi Shen*, Xiaohan Chen*, Howard Heaton*, Tianlong Chen, Jialin Liu, Wotao…☆15Dec 30, 2021Updated 4 years ago
- Code for the EMNLP2020 long paper "Lifelong Language Knowledge Distillation" https://arxiv.org/abs/2010.02123☆12Jul 13, 2021Updated 4 years ago
- This repositorie es the code of the paper Optimizing Reusable Knowledge for Continual Learning via Metalearning.☆11Oct 12, 2021Updated 4 years ago
- ☆14Sep 22, 2020Updated 5 years ago
- [WACV 2022] "Sandwich Batch Normalization: A Drop-In Replacement for Feature Distribution Heterogeneity" by Xinyu Gong, Wuyang Chen, Tian…☆51Dec 29, 2021Updated 4 years ago
- [ICML 2021] "Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training" by Shiwei Liu, Lu Yin, De…☆45Nov 11, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆24Jun 22, 2022Updated 3 years ago
- Implementation for <Regularizing Neural Networks via Minimizing Hyperspherical Energy> in CVPR'20.☆24Jun 23, 2020Updated 5 years ago
- Code for reproducing the results in "How Well do Sparse Imagenet Models Transfer?", presented at CVPR 2022☆10Jun 3, 2022Updated 4 years ago
- ☆12Jul 15, 2020Updated 5 years ago
- HGRN2: Gated Linear RNNs with State Expansion☆57Aug 20, 2024Updated last year
- Correspondence Networks with Adaptive Neighbourhood Consensus☆23Jun 15, 2020Updated 5 years ago
- generative models on toys☆12Sep 10, 2024Updated last year
- NN1 network from FaceNet: A Unified Embedding for Face Recognition and Clustering, in Keras.☆11Jun 13, 2017Updated 8 years ago
- Data sets from the book "Forecasting with exponential smoothing: the state space approach" by Hyndman, Koehler, Ord and Snyder (Springer,…☆15Jan 1, 2026Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆25Apr 5, 2022Updated 4 years ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆57Dec 4, 2024Updated last year
- AutoGrow: Automatic Layer Growing in Deep Convolutional Networks (KDD 2020)☆40Jun 10, 2019Updated 7 years ago
- This is the official code for UGTs.☆13Feb 8, 2023Updated 3 years ago
- a jax benchmark for ad hoc teamwork☆21Jun 2, 2026Updated last week
- Official Code Repository for La-MAML: Look-Ahead Meta-Learning for Continual Learning"☆82Dec 16, 2020Updated 5 years ago
- ICLR 2022☆18Apr 15, 2022Updated 4 years ago
- Code for testing DCT plus Sparse (DCTpS) networks☆14Jun 15, 2021Updated 4 years ago
- Jax implementation of the AdaHessian optimizer☆19Mar 11, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Apr 21, 2025Updated last year
- Official code for ICLR 2020 paper "A Neural Dirichlet Process Mixture Model for Task-Free Continual Learning."☆101Aug 22, 2020Updated 5 years ago
- Code for "Supermasks in Superposition"☆126Oct 3, 2023Updated 2 years ago
- pytorch maml with Multi-GPUs, fast and simplest implementation☆13Dec 4, 2020Updated 5 years ago
- (CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers☆25Feb 26, 2025Updated last year
- The codes for the paper of "A particle swarm optimization-based flexible convolutional auto-encoder for image classification" published b…☆10Jul 21, 2020Updated 5 years ago
- Official Implementation of CVPR2021 paper: Continual Learning via Bit-Level Information Preserving☆39Jan 24, 2023Updated 3 years ago