OSVAI / NORMLinks
The official project website of "NORM: Knowledge Distillation via N-to-One Representation Matching" (The paper of NORM is published in ICLR 2023).
☆20Updated last year
Alternatives and similar repositories for NORM
Users that are interested in NORM are comparing it to the libraries listed below
Sorting:
- [ICML2024] DetKDS: Knowledge Distillation Search for Object Detectors☆15Updated last year
- Official Pytorch implementation of Super Vision Transformer (IJCV)☆43Updated 2 years ago
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆75Updated last year
- ☆26Updated last year
- Code for RepNAS☆13Updated 3 years ago
- S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)☆64Updated 4 years ago
- This repo is the official megengine implementation of the ECCV2022 paper: Efficient One Pass Self-distillation with Zipf's Label Smoothin…☆26Updated 2 years ago
- Implementation of PGONAS for CVPR22W and RD-NAS for ICASSP23☆22Updated 2 years ago
- ☆47Updated 2 years ago
- Official implementation for "SimA: Simple Softmax-free Attention for Vision Transformers"☆43Updated last year
- Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer☆73Updated 3 years ago
- Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"☆65Updated 2 years ago
- ☆27Updated 2 years ago
- [NeurIPS 2023] Towards Free Data Selection with General-Purpose Models☆40Updated 5 months ago
- Official implementation of paper "Masked Distillation with Receptive Tokens", ICLR 2023.☆71Updated 2 years ago
- ☆19Updated 3 years ago
- TF-FD☆20Updated 2 years ago
- Official implementation of the paper "Function-Consistent Feature Distillation" (ICLR 2023)☆29Updated 2 years ago
- [ACM MM'23] Official implementation of paper "Avatar Knowledge Distillation: Self-ensemble Teacher Paradigm with Uncertainty".☆14Updated last year
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆83Updated last year
- [ICCV 2021] FaPN: Feature-aligned Pyramid Network for Dense Image Prediction☆28Updated 3 years ago
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆97Updated 3 years ago
- [ICCV 2021] Official implementation of "Scalable Vision Transformers with Hierarchical Pooling"☆33Updated 3 years ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆28Updated 2 years ago
- [ICLR'23] Trainability Preserving Neural Pruning (PyTorch)☆34Updated 2 years ago
- Official PyTorch implementation of "Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets" (ICLR 2023 notable top 25%)☆24Updated last year
- [NeurIPS 2024] Search for Efficient LLMs☆13Updated 7 months ago
- Pytorch implementation of our paper accepted by ECCV2022 -- Knowledge Condensation Distillation https://arxiv.org/abs/2207.05409☆30Updated 2 years ago
- [Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…☆40Updated 2 years ago
- [ACL'22] Training-free Neural Architecture Search for RNNs and Transformers☆14Updated last year