OPTML-Group / Robust-MoE-CNNLinks
[ICCV23] Robust Mixture-of-Expert Training for Convolutional Neural Networks by Yihua Zhang, Ruisi Cai, Tianlong Chen, Guanhua Zhang, Huan Zhang, Pin-Yu Chen, Shiyu Chang, Zhangyang (Atlas) Wang, Sijia Liu
☆61Updated 2 years ago
Alternatives and similar repositories for Robust-MoE-CNN
Users that are interested in Robust-MoE-CNN are comparing it to the libraries listed below
Sorting:
- PyTorch code and checkpoints release for OFA-KD: https://arxiv.org/abs/2310.19444☆132Updated last year
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆69Updated 11 months ago
- ☆91Updated 2 years ago
- ImageNet-1K data download, processing for using as a dataset☆108Updated 2 years ago
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆90Updated last year
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆107Updated 2 years ago
- Official PyTorch(MMCV) implementation of “Adversarial AutoMixup” (ICLR 2024 spotlight)☆69Updated 10 months ago
- [ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass☆192Updated 2 years ago
- 这里陈列了我编写的一些关于深度学习的画图工具,如果觉得有帮助可以给个star.☆41Updated 3 years ago
- (NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …☆129Updated 10 months ago
- ☆27Updated 2 years ago
- Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022☆149Updated 2 years ago
- The official implementation of paper: "Inter-Instance Similarity Modeling for Contrastive Learning"☆116Updated 10 months ago
- [CVPR-2024] Official implementations of CLIP-KD: An Empirical Study of CLIP Model Distillation☆125Updated 3 weeks ago
- Awesome-Low-Rank-Adaptation☆115Updated 11 months ago
- [CVPR-2022] Official implementation for "Knowledge Distillation with the Reused Teacher Classifier".☆100Updated 3 years ago
- [CVPR 2024] On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm☆74Updated 6 months ago
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆61Updated last year
- 'NKD and USKD' (ICCV 2023) and 'ViTKD' (CVPRW 2024)☆237Updated last year
- ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching☆101Updated last year
- Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"☆104Updated 2 years ago
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning (CVPR '24)☆62Updated 2 months ago
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆32Updated 2 years ago
- ☆26Updated last year
- This resposity maintains a collection of important papers on knowledge distillation (awesome-knowledge-distillation)).☆80Updated 5 months ago
- The official repo for CVPR2023 highlight paper "Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization".☆85Updated 2 years ago
- The official implementation for paper: Improving Knowledge Distillation via Regularizing Feature Norm and Direction☆22Updated 2 years ago
- [CVPR2024] Efficient Dataset Distillation via Minimax Diffusion☆98Updated last year
- Official code for Scale Decoupled Distillation☆41Updated last year
- ☆14Updated 2 years ago