MingSun-Tse / Good-DA-in-KDLinks
[NeurIPS'22] What Makes a "Good" Data Augmentation in Knowledge Distillation -- A Statistical Perspective
☆37Updated 2 years ago
Alternatives and similar repositories for Good-DA-in-KD
Users that are interested in Good-DA-in-KD are comparing it to the libraries listed below
Sorting:
- [Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…☆40Updated 2 years ago
- ☆19Updated 2 years ago
- [ICLR'23] Trainability Preserving Neural Pruning (PyTorch)☆33Updated 2 years ago
- This repo is the official megengine implementation of the ECCV2022 paper: Efficient One Pass Self-distillation with Zipf's Label Smoothin…☆26Updated 2 years ago
- (CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers☆25Updated 3 months ago
- Code for ViTAS_Vision Transformer Architecture Search☆50Updated 3 years ago
- ☆27Updated 2 years ago
- A generic code base for neural network pruning, especially for pruning at initialization.☆30Updated 2 years ago
- ☆57Updated 4 years ago
- BESA is a differentiable weight pruning technique for large language models.☆17Updated last year
- PyTorch implementation of paper "Dataset Distillation via Factorization" in NeurIPS 2022.☆66Updated 2 years ago
- [IJCAI-2021] Contrastive Model Inversion for Data-Free Knowledge Distillation☆72Updated 3 years ago
- Official Codes and Pretrained Models for RecursiveMix☆22Updated 2 years ago
- Benchmarking Attention Mechanism in Vision Transformers.☆18Updated 2 years ago
- Bag of Instances Aggregation Boosts Self-supervised Distillation (ICLR 2022)☆33Updated 3 years ago
- ☆31Updated 5 years ago
- Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer☆72Updated 2 years ago
- [ICLR 2022]: Fast AdvProp☆35Updated 3 years ago
- [NeurIPS'21] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang…☆90Updated last year
- Repo for the paper "Extrapolating from a Single Image to a Thousand Classes using Distillation"☆36Updated 11 months ago
- Pytorch implementation of our paper accepted by ECCV2022 -- Knowledge Condensation Distillation https://arxiv.org/abs/2207.05409☆30Updated 2 years ago
- Official Pytorch implementation of Super Vision Transformer (IJCV)☆43Updated last year
- [ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…☆53Updated last year
- Official pytorch implementation for CVPR2022 paper "Bootstrapping ViTs: Towards Liberating Vision Transformers from Pre-training"☆17Updated 3 years ago
- Code for ECCV 2022 paper “Learning with Recoverable Forgetting”☆21Updated 2 years ago
- ☆45Updated last year
- Data-free knowledge distillation using Gaussian noise (NeurIPS paper)☆15Updated 2 years ago
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆39Updated 2 years ago
- [ICML2023] Revisiting Data-Free Knowledge Distillation with Poisoned Teachers☆22Updated 11 months ago
- i-mae Pytorch Repo☆20Updated last year