a7b23 / Distilling-the-knowledge-in-neural-network
Teaches a student network from the knowledge obtained via training of a larger teacher network
☆156Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for Distilling-the-knowledge-in-neural-network
- TensorFlow Implementation of Deep Mutual Learning☆320Updated 6 years ago
- Knowledge distillation methods implemented with Tensorflow (now there are 11 (+1) methods, and will be added more.)☆266Updated 5 years ago
- PyTorch implementation of "Distilling the Knowledge in a Neural Network" for model compression☆59Updated 7 years ago
- Knowledge Distillation using Tensorflow☆142Updated 5 years ago
- Implementation of model compression with knowledge distilling method.☆346Updated 7 years ago
- Tools for computing model parameters and FLOPs.☆86Updated 5 years ago
- Official Implementation of MEAL: Multi-Model Ensemble via Adversarial Learning on AAAI 2019☆177Updated 4 years ago
- Using Teacher Assistants to Improve Knowledge Distillation: https://arxiv.org/pdf/1902.03393.pdf☆258Updated 5 years ago
- FitNets: Hints for Thin Deep Nets☆204Updated 9 years ago
- Implementation of the mixup training method☆465Updated 6 years ago
- [CVPR 2020] MTL-NAS: Task-Agnostic Neural Architecture Search towards General-Purpose Multi-Task Learning☆92Updated last year
- Learning What and Where to Transfer (ICML 2019)☆250Updated 4 years ago
- Knowledge Transfer via Distillation of Activation Boundaries Formed by Hidden Neurons (AAAI 2019)☆104Updated 5 years ago
- The implementation of “Gradient Harmonized Single-stage Detector” published on AAAI 2019.☆617Updated 5 years ago
- A universal and efficient framework for training well-performing light net☆124Updated 7 years ago
- Implements quantized distillation. Code for our paper "Model compression via distillation and quantization"☆330Updated 3 months ago
- extract features by maximizing mutual information☆147Updated 5 years ago
- ☆231Updated 5 years ago
- A machine learning experiment☆182Updated 7 years ago
- ☆38Updated 6 years ago
- Pytorch implementation of SNAS☆75Updated 5 years ago
- Random miniprojects with pytorch.☆173Updated 6 years ago
- ☆75Updated 5 years ago
- ☆21Updated 2 years ago
- Code for the NuerIPS'19 paper "Gate Decorator: Global Filter Pruning Method for Accelerating Deep Convolutional Neural Networks"☆196Updated 4 years ago
- 3.41% and 17.11% error on CIFAR-10 and CIFAR-100☆328Updated 5 years ago
- Implementation and experiments for AdamW on Pytorch☆93Updated 4 years ago
- ☆165Updated last year
- Multinomial Distribution Learning for Effective Neural Architecture Search☆207Updated 5 years ago
- [ICML 2018] "Deep k-Means: Re-Training and Parameter Sharing with Harder Cluster Assignments for Compressing Deep Convolutions"☆150Updated 2 years ago