lliai / DisWOT-CVPR2023Links
☆28Updated 2 years ago
Alternatives and similar repositories for DisWOT-CVPR2023
Users that are interested in DisWOT-CVPR2023 are comparing it to the libraries listed below
Sorting:
- ☆28Updated 3 years ago
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆71Updated last year
- ☆48Updated 2 years ago
- [NeurIPS 2024] Search for Efficient LLMs☆16Updated last year
- The official project website of "NORM: Knowledge Distillation via N-to-One Representation Matching" (The paper of NORM is published in IC…☆20Updated 2 years ago
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆87Updated last year
- [ICLR'23] Trainability Preserving Neural Pruning (PyTorch)☆34Updated 2 years ago
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆33Updated 3 years ago
- [Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…☆41Updated 4 months ago
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆76Updated 2 years ago
- TF-FD☆20Updated 3 years ago
- Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer☆74Updated 3 years ago
- [ECCV-2022] Official implementation of MixSKD: Self-Knowledge Distillation from Mixup for Image Recognition && Pytorch Implementations of…☆110Updated 3 years ago
- Switchable Online Knowledge Distillation☆19Updated last year
- Implementation of PGONAS for CVPR22W and RD-NAS for ICASSP23☆23Updated 2 years ago
- [ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging tech…☆104Updated 2 years ago
- ☆13Updated 2 years ago
- To appear in the 11th International Conference on Learning Representations (ICLR 2023).☆18Updated 2 years ago
- Official implementation for "Knowledge Distillation with Refined Logits".☆21Updated last year
- [ICML2024] DetKDS: Knowledge Distillation Search for Object Detectors☆19Updated last year
- [CVPR-2022] Official implementation for "Knowledge Distillation with the Reused Teacher Classifier".☆100Updated 3 years ago
- BESA is a differentiable weight pruning technique for large language models.☆17Updated last year
- ☆23Updated last year
- Pytorch implementation of our paper accepted by IEEE TNNLS, 2022 — Carrying out CNN Channel Pruning in a White Box☆18Updated 3 years ago
- Source code of our TNNLS paper "Boosting Convolutional Neural Networks with Middle Spectrum Grouped Convolution"☆12Updated 2 years ago
- ☆20Updated 3 years ago
- [NeurIPS 2022] Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach -- Official Implementation☆47Updated 2 years ago
- Official implementation of the paper "Function-Consistent Feature Distillation" (ICLR 2023)☆30Updated 2 years ago
- Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022☆154Updated 3 years ago
- ☆24Updated 2 years ago