megvii-research / zipfls
This repo is the official megengine implementation of the ECCV2022 paper: Efficient One Pass Self-distillation with Zipf's Label Smoothing.
☆25Updated last year
Related projects: ⓘ
- Official Codes and Pretrained Models for RecursiveMix☆22Updated last year
- ☆16Updated this week
- Official implementation of the paper "Function-Consistent Feature Distillation" (ICLR 2023)☆25Updated last year
- [NeurIPS'22] What Makes a "Good" Data Augmentation in Knowledge Distillation -- A Statistical Perspective☆35Updated last year
- Official Pytorch implementation for Distilling Image Classifiers in Object detection (NeurIPS2021)☆30Updated 2 years ago
- Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer☆69Updated 2 years ago
- Benchmarking Attention Mechanism in Vision Transformers.☆16Updated last year
- Official Pytorch implementation of Super Vision Transformer (IJCV)☆42Updated last year
- Code and models for the paper Glance-and-Gaze Vision Transformer☆28Updated 3 years ago
- [NeurIPS 2023] Towards Free Data Selection with General-Purpose Models☆32Updated 5 months ago
- ☆55Updated 3 years ago
- S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)☆63Updated 3 years ago
- Pytorch implementation of our paper accepted by IEEE TNNLS, 2021 -- Network Pruning using Adaptive Exemplar Filters☆21Updated 3 years ago
- Official implementation for "SimA: Simple Softmax-free Attention for Vision Transformers"☆34Updated 5 months ago
- [ICCV 2021] Official implementation of "Scalable Vision Transformers with Hierarchical Pooling"☆30Updated 2 years ago
- Official implementation for paper "DyRep: Bootstrapping Training with Dynamic Re-parameterization", CVPR 2022☆42Updated 2 years ago
- Deep Structured Instance Graph for Distilling Object Detectors (ICCV 2021)☆35Updated 2 years ago
- ☆9Updated 2 years ago
- Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"☆62Updated 2 years ago
- [ICLR 2022]: Fast AdvProp☆35Updated 2 years ago
- An implementation of <Group Fisher Pruning for Practical Network Compression> based on pytorch and mmcv☆17Updated 2 years ago
- Knowledge Transfer via Dense Cross-layer Mutual-distillation (ECCV'2020)☆30Updated 4 years ago
- The official project website of "NORM: Knowledge Distillation via N-to-One Representation Matching" (The paper of NORM is published in IC…☆19Updated last year
- ☆16Updated 2 years ago
- Switchable Online Knowledge Distillation☆16Updated last year
- Teach-DETR: Better Training DETR with Teachers☆28Updated 6 months ago
- Lightweight Transformer for Multi-modal Tasks☆15Updated last year
- ☆26Updated last year
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆27Updated last year
- Official PyTorch implementation of Data-free Knowledge Distillation for Object Detection, WACV 2021.☆61Updated 2 years ago