Kennethborup / knowledgeDistillationLinks

PyTorch implementation of (Hinton) Knowledge Distillation and a base class for simple implementation of other distillation methods.

☆29

Alternatives and similar repositories for knowledgeDistillation

Users that are interested in knowledgeDistillation are comparing it to the libraries listed below

Sorting:

vrvlive / knowlege-distillation
PyTorch, PyTorch Lightning framework for trying knowledge distillation in image classification problems
☆32Updated last year
knotgrass / attention
several types of attention modules written in PyTorch for learning purposes
☆52Updated last year
hoya012 / swa-tutorials-pytorch
Stochastic Weight Averaging Tutorials using pytorch.
☆33Updated 4 years ago
minhtannguyen / transformer-mgk
This is the public github for our paper "Transformer with a Mixture of Gaussian Keys"
☆28Updated 3 years ago
fkodom / soft-mixture-of-experts
PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)
☆78Updated 2 years ago
Lornatang / VGG-PyTorch
The implementation of VGG thesis is implemented under PyTorch framework
☆35Updated 2 years ago
ntucllab / imbalanced-DL
A Python Package for Deep Imbalanced Learning
☆57Updated 2 months ago
DefangChen / Knowledge-Distillation-Paper
This resposity maintains a collection of important papers on knowledge distillation (awesome-knowledge-distillation)).
☆80Updated 7 months ago
da2so / Zero-shot_Knowledge_Distillation_Pytorch
ZSKD with PyTorch
☆31Updated 2 years ago
facebookresearch / ModelRatatouille
Recycling diverse models
☆45Updated 2 years ago
shriramsb / Distilling-the-Knowledge-in-a-Neural-Network
Demonstration of transfer of knowledge and generalization with distillation
☆55Updated 6 years ago
GitiHubi / deepContinualAuditing
☆10Updated 3 years ago
jhoon-oh / kd_data
IJCAI 2021, "Comparing Kullback-Leibler Divergence and Mean Squared Error Loss in Knowledge Distillation"
☆42Updated 2 years ago
lucidrains / tableformer-pytorch
Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch
☆39Updated 3 years ago
huawei-noah / Efficient-NLP
☆95Updated last year
YeonwooSung / Pytorch_mixture-of-experts
PyTorch implementation of moe, which stands for mixture of experts
☆49Updated 4 years ago
VirtuosoResearch / Regularized-Self-Labeling
A regularized self-labeling approach to improve the generalization and robustness of fine-tuned models
☆27Updated 3 years ago
pkuzengqi / Skyformer
Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)
☆63Updated 3 years ago
chenpf1025 / RobustnessAccuracy
AAAI 2021: Robustness of Accuracy Metric and its Inspirations in Learning with Noisy Labels
☆23Updated 4 years ago
VITA-Group / layerGraftedPretraining_ICLR23
[ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…
☆24Updated 2 years ago
ttchengab / mixup
☆19Updated 4 years ago
salesforce / ensemble-of-averages
☆31Updated 5 months ago
clovaai / group-transformer
Official code for Group-Transformer (Scale down Transformer by Grouping Features for a Lightweight Character-level Language Model, COLING…
☆27Updated 4 years ago
dlmacedo / entropic-out-of-distribution-detection
A project to add scalable state-of-the-art out-of-distribution detection (open set recognition) support by changing two lines of code! Pe…
☆79Updated 3 years ago
microsoft / AutoMoE
AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers
☆47Updated 2 years ago
twinkle0331 / LGTM
[ACL 2023] Code for paper “Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation”(https://arxiv.org/abs/2305.…
☆38Updated 2 years ago
anshuman23 / InfDataSel
Code for paper: “What Data Benefits My Classifier?” Enhancing Model Performance and Interpretability through Influence-Based Data Selecti…
☆24Updated last year
RyanWangZf / Influence_Subsampling
Official Implementation of Unweighted Data Subsampling via Influence Function - AAAI 2020
☆64Updated 4 years ago
stanislavfort / adversaries_to_convnext
Adversarial examples to the new ConvNeXt architecture
☆20Updated 3 years ago
uclaml / MoE
Towards Understanding the Mixture-of-Experts Layer in Deep Learning
☆31Updated last year