gmum / Zero-Time-WasteLinks
☆13Updated last year
Alternatives and similar repositories for Zero-Time-Waste
Users that are interested in Zero-Time-Waste are comparing it to the libraries listed below
Sorting:
- [Neurips 2021] Sparse Training via Boosting Pruning Plasticity with Neuroregeneration☆31Updated 2 years ago
- Dynamic Dual Gating Neural Networks, ICCV 2021☆32Updated 2 years ago
- [ICLR-2020] Dynamic Sparse Training: Find Efficient Sparse Network From Scratch With Trainable Masked Layers.☆31Updated 5 years ago
- ☆26Updated 2 years ago
- Source Code for ICML 2019 Paper "Shallow-Deep Networks: Understanding and Mitigating Network Overthinking"☆36Updated last year
- Soft Threshold Weight Reparameterization for Learnable Sparsity☆91Updated 2 years ago
- A generic code base for neural network pruning, especially for pruning at initialization.☆31Updated 2 years ago
- [ICML 2021] "Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training" by Shiwei Liu, Lu Yin, De…☆45Updated last year
- Code for our ICLR'2021 paper "DrNAS: Dirichlet Neural Architecture Search"☆43Updated 4 years ago
- ☆9Updated 3 years ago
- How Do Adam and Training Strategies Help BNNs Optimization? In ICML 2021.☆60Updated 4 years ago
- Codebase for the paper "A Gradient Flow Framework for Analyzing Network Pruning"☆21Updated 4 years ago
- Pytorch implementation of the paper "SNIP: Single-shot Network Pruning based on Connection Sensitivity" by Lee et al.☆108Updated 6 years ago
- Conditional channel- and precision-pruning on neural networks☆72Updated 5 years ago
- [NeurIPS 2021] “Stronger NAS with Weaker Predictors“, Junru Wu, Xiyang Dai, Dongdong Chen, Yinpeng Chen, Mengchen Liu, Ye Yu, Zhangyang W…☆27Updated 2 years ago
- [NeurIPS'21] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang…☆89Updated last year
- A pytorch implement of scalable neural netowrks.☆23Updated 5 years ago
- Data-Free Network Quantization With Adversarial Knowledge Distillation PyTorch☆30Updated 3 years ago
- [AAAI-2020] Official implementation for "Online Knowledge Distillation with Diverse Peers".☆74Updated 2 years ago
- Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight…☆63Updated last year
- Code for our ICML'2020 paper "Stabilizing Differentiable Architecture Search via Perturbation-based Regularization"☆76Updated 4 years ago
- code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"☆104Updated 3 years ago
- ☆22Updated 5 years ago
- ☆31Updated 3 years ago
- The implementation of AAAI 2021 Paper: "Progressive Network Grafting for Few-Shot Knowledge Distillation".☆33Updated last year
- Codes for paper "Few Shot Network Compression via Cross Distillation", AAAI 2020.☆32Updated 5 years ago
- [ICLR 2021 Spotlight] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yinin…☆31Updated last year
- Code for ViTAS_Vision Transformer Architecture Search☆50Updated 4 years ago
- [ICML2022] Training Your Sparse Neural Network Better with Any Mask. Ajay Jaiswal, Haoyu Ma, Tianlong Chen, ying Ding, and Zhangyang Wang☆28Updated 3 years ago
- Code for our ICLR'2022 paper "Generalizing Few-Shot NAS with Gradient Matching"☆22Updated 2 years ago