SJLeo / FFSDLinks
Pytorch implementation of our paper accepted by IEEE TNNLS, 2022 -- Distilling a Powerful Student Model via Online Knowledge Distillation
☆31Updated 4 years ago
Alternatives and similar repositories for FFSD
Users that are interested in FFSD are comparing it to the libraries listed below
Sorting:
- ☆58Updated 4 years ago
- S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)☆65Updated 4 years ago
- ☆20Updated 2 years ago
- ☆27Updated 3 years ago
- A simple reimplement Online Knowledge Distillation via Collaborative Learning with pytorch☆50Updated 3 years ago
- [AAAI-2020] Official implementation for "Online Knowledge Distillation with Diverse Peers".☆76Updated 2 years ago
- Implementation of the Heterogeneous Knowledge Distillation using Information Flow Modeling method☆25Updated 5 years ago
- Source Code for "Dual-Level Knowledge Distillation via Knowledge Alignment and Correlation", TNNLS, https://ieeexplore.ieee.org/abstract/…☆12Updated 3 years ago
- ☆34Updated 2 years ago
- Feature Fusion for Online Mutual Knowledge Distillation Code☆27Updated 5 years ago
- Codes for paper "Few Shot Network Compression via Cross Distillation", AAAI 2020.☆31Updated 6 years ago
- [CVPR '23] PA&DA: Jointly Sampling PAth and DAta for Consistent NAS☆36Updated 2 years ago
- Knowledge Transfer via Dense Cross-layer Mutual-distillation (ECCV'2020)☆30Updated 5 years ago
- Implementation of PGONAS for CVPR22W and RD-NAS for ICASSP23☆23Updated 2 years ago
- Self-distillation with Batch Knowledge Ensembling Improves ImageNet Classification☆82Updated 4 years ago
- Neuron Merging: Compensating for Pruned Neurons (NeurIPS 2020)☆43Updated 5 years ago
- The implementation of AAAI 2021 Paper: "Progressive Network Grafting for Few-Shot Knowledge Distillation".☆35Updated last year
- Code for Paper "Self-Distillation from the Last Mini-Batch for Consistency Regularization"☆43Updated 3 years ago
- ☆23Updated 6 years ago
- Code for ViTAS_Vision Transformer Architecture Search☆51Updated 4 years ago
- Revisiting Parameter Sharing for Automatic Neural Channel Number Search, NeurIPS 2020☆22Updated 5 years ago
- This is an official implementation of our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Attentional Transforms".☆12Updated 5 years ago
- [NeurIPS'21] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang…☆89Updated 2 years ago
- ☆13Updated 4 years ago
- (NeurIPS 2019) Deep Model Transferbility from Attribution Maps☆20Updated 6 years ago
- AMTML-KD: Adaptive Multi-teacher Multi-level Knowledge Distillation☆66Updated 4 years ago
- Paper collection about model compression and acceleration: Pruning, Quantization, Knowledge Distillation, Low Rank Factorization, etc☆25Updated 5 years ago
- TF-FD☆20Updated 3 years ago
- IJCAI 2021, "Comparing Kullback-Leibler Divergence and Mean Squared Error Loss in Knowledge Distillation"☆42Updated 3 years ago
- Official code of "NAS acceleration via proxy data", IJCAI21☆10Updated 3 years ago