bellymonster / Weighted-Soft-Label-Distillation
☆57Updated 3 years ago
Alternatives and similar repositories for Weighted-Soft-Label-Distillation
Users that are interested in Weighted-Soft-Label-Distillation are comparing it to the libraries listed below
Sorting:
- UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning☆55Updated 3 years ago
- Bag of Instances Aggregation Boosts Self-supervised Distillation (ICLR 2022)☆33Updated 3 years ago
- Self-distillation with Batch Knowledge Ensembling Improves ImageNet Classification☆82Updated 3 years ago
- Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers☆26Updated 3 years ago
- Knowledge Transfer via Dense Cross-layer Mutual-distillation (ECCV'2020)☆29Updated 4 years ago
- Code for ViTAS_Vision Transformer Architecture Search☆51Updated 3 years ago
- Official PyTorch implementation of "Co-Mixup: Saliency Guided Joint Mixup with Supermodular Diversity" (ICLR'21 Oral)☆103Updated 3 years ago
- Improving Contrastive Learning by Visualizing Feature Transformation, ICCV 2021 Oral☆90Updated 3 years ago
- Pytorch implementation of CVPR2021 paper: SuperMix: Supervising the Mixing Data Augmentation☆92Updated 3 years ago
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆28Updated 2 years ago
- ☆36Updated 3 years ago
- [ICLR 2021 Spotlight Oral] "Undistillable: Making A Nasty Teacher That CANNOT teach students", Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Che…☆81Updated 3 years ago
- [NeurIPS'21] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang…☆89Updated last year
- ReSSL: Relational Self-Supervised Learning with Weak Augmentation☆58Updated 3 years ago
- [ICLR 2022]: Fast AdvProp☆35Updated 3 years ago
- Benchmarking Attention Mechanism in Vision Transformers.☆17Updated 2 years ago
- Code and models for the paper Glance-and-Gaze Vision Transformer☆28Updated 3 years ago
- (CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers☆25Updated 2 months ago
- Compressing Representations for Self-Supervised Learning☆78Updated 4 years ago
- [ICML 2021] “ Self-Damaging Contrastive Learning”, Ziyu Jiang, Tianlong Chen, Bobak Mortazavi, Zhangyang Wang☆63Updated 3 years ago
- Un-Mix: Rethinking Image Mixtures for Unsupervised Visual Representation Learning.☆151Updated 2 years ago
- (NeurIPS 2020 Workshop on SSL) Official Implementation of "MixCo: Mix-up Contrastive Learning for Visual Representation"☆58Updated 2 years ago
- [NeurIPS 2023] Towards Free Data Selection with General-Purpose Models☆35Updated 2 months ago
- Implementation of the Heterogeneous Knowledge Distillation using Information Flow Modeling method☆24Updated 4 years ago
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆55Updated last year
- Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".☆16Updated 3 years ago
- ☆45Updated 3 years ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Updated 3 years ago
- Batch Normalization with Enhanced Linear Transformation☆53Updated last year
- Official code for "Mean Shift for Self-Supervised Learning"☆57Updated 3 years ago