megvii-research / mdistiller
The official implementation of [CVPR2022] Decoupled Knowledge Distillation https://arxiv.org/abs/2203.08679 and [ICCV2023] DOT: A Distillation-Oriented Trainer https://openaccess.thecvf.com/content/ICCV2023/papers/Zhao_DOT_A_Distillation-Oriented_Trainer_ICCV_2023_paper.pdf
☆830Updated last year
Alternatives and similar repositories for mdistiller:
Users that are interested in mdistiller are comparing it to the libraries listed below
- Pytorch implementation of various Knowledge Distillation (KD) methods.☆1,661Updated 3 years ago
- Distilling Knowledge via Knowledge Review, CVPR 2021☆266Updated 2 years ago
- [ICLR 2020] Contrastive Representation Distillation (CRD), and benchmark of recent knowledge distillation methods☆2,263Updated last year
- Knowledge Distillation: CVPR2020 Oral, Revisiting Knowledge Distillation via Label Smoothing Regularization☆585Updated 2 years ago
- 'NKD and USKD' (ICCV 2023) and 'ViTKD' (CVPRW 2024)☆224Updated last year
- OpenMMLab Model Compression Toolbox and Benchmark.☆1,536Updated 8 months ago
- Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。☆2,545Updated last year
- ☆422Updated 2 years ago
- CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark☆637Updated 3 months ago
- [NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification☆593Updated last year
- [AAAI 2023] Official PyTorch Code for "Curriculum Temperature for Knowledge Distillation"☆163Updated 2 months ago
- A coding-free framework built on PyTorch for reproducible deep learning studies. PyTorch Ecosystem. 🏆25 knowledge distillation methods p…☆1,448Updated 2 weeks ago
- A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility☆1,907Updated last year
- assistant tools for attention visualization in deep learning☆1,089Updated 2 years ago
- Focal and Global Knowledge Distillation for Detectors (CVPR 2022)☆365Updated 2 years ago
- knowledge distillation papers☆748Updated 2 years ago
- Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022☆140Updated 2 years ago
- Masked Generative Distillation (ECCV 2022)☆216Updated 2 years ago
- This is a collection of our NAS and Vision Transformer work.☆1,718Updated 6 months ago
- PyTorch implementation of MoCo v3 https//arxiv.org/abs/2104.02057☆1,242Updated 3 years ago
- The implementation of various lightweight networks by using PyTorch. such as:MobileNetV2,MobileNeXt,GhostNet,ParNet,MobileViT、AdderNet,Sh…☆846Updated 2 years ago
- Efficient computing methods developed by Huawei Noah's Ark Lab☆1,239Updated 3 months ago
- PyTorch code and checkpoints release for OFA-KD: https://arxiv.org/abs/2310.19444☆108Updated 10 months ago
- ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet☆1,176Updated last year
- A codebase and a curated list of awesome deep long-tailed learning (TPAMI 2023).☆529Updated 2 months ago
- Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs (CVPR 2022)☆891Updated 9 months ago
- A quickstart and benchmark for pytorch distributed training.☆1,651Updated 6 months ago
- Official PyTorch implementation of "A Comprehensive Overhaul of Feature Distillation" (ICCV 2019)☆415Updated 4 years ago
- Official code for our ECCV'22 paper "A Fast Knowledge Distillation Framework for Visual Recognition"☆184Updated 9 months ago
- This is a collection of our zero-cost NAS and efficient vision applications.☆401Updated last year