kunglab / branchynet
☆124Updated last year
Alternatives and similar repositories for branchynet:
Users that are interested in branchynet are comparing it to the libraries listed below
- ☆116Updated 6 years ago
- Pytorch-based early exit network inspired by branchynet☆31Updated last week
- 基于提前退出部分样本原理而实现的带分支网络(supported by chainer)☆43Updated 5 years ago
- Partial implementation of paper "DEEP GRADIENT COMPRESSION: REDUCING THE COMMUNICATION BANDWIDTH FOR DISTRIBUTED TRAINING"☆31Updated 4 years ago
- FedNAS: Federated Deep Learning via Neural Architecture Search☆53Updated 3 years ago
- Code for "Adaptive Gradient Quantization for Data-Parallel SGD", published in NeurIPS 2020.☆30Updated 4 years ago
- ☆213Updated 6 years ago
- Code for the signSGD paper☆83Updated 4 years ago
- Mayo: Auto-generation of hardware-friendly deep neural networks. Dynamic Channel Pruning: Feature Boosting and Suppression.☆114Updated 5 years ago
- vector quantization for stochastic gradient descent.☆33Updated 4 years ago
- [ICLR 2018] Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training☆219Updated 7 months ago
- [CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision☆379Updated 4 years ago
- Measuring and predicting on-device metrics (latency, power, etc.) of machine learning models☆66Updated last year
- Prune DNN using Alternating Direction Method of Multipliers (ADMM)☆108Updated 4 years ago
- LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks☆242Updated 2 years ago
- Code example for the ICLR 2018 oral paper☆151Updated 6 years ago
- About DNN compression and acceleration on Edge Devices.☆56Updated 3 years ago
- Reducing the size of convolutional neural networks☆114Updated 7 years ago
- Code for SkipNet: Learning Dynamic Routing in Convolutional Networks (ECCV 2018)☆239Updated 5 years ago
- Pytorch implementation of the paper "SNIP: Single-shot Network Pruning based on Connection Sensitivity" by Lee et al.☆106Updated 5 years ago
- Sparsified SGD with Memory: https://arxiv.org/abs/1809.07599☆59Updated 6 years ago
- Quantization of Convolutional Neural networks.☆243Updated 6 months ago
- 2-stage pruning to favor distributed inference (local device compute half of the model, upload the feature for further computing on stron…☆23Updated 6 years ago
- ☆46Updated 5 years ago
- Learning both Weights and Connections for Efficient Neural Networks https://arxiv.org/abs/1506.02626☆176Updated 2 years ago
- ☆74Updated 5 years ago
- This project will realize experiments about BranchyNet partitioning using pytorch framework☆27Updated 4 years ago
- Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency☆26Updated 4 years ago
- Implements quantized distillation. Code for our paper "Model compression via distillation and quantization"☆332Updated 7 months ago
- ☆48Updated 5 years ago