OPTML-Group / Robust-MoE-CNN
[ICCV23] Robust Mixture-of-Expert Training for Convolutional Neural Networks by Yihua Zhang, Ruisi Cai, Tianlong Chen, Guanhua Zhang, Huan Zhang, Pin-Yu Chen, Shiyu Chang, Zhangyang (Atlas) Wang, Sijia Liu
☆55Updated last year
Alternatives and similar repositories for Robust-MoE-CNN:
Users that are interested in Robust-MoE-CNN are comparing it to the libraries listed below
- PyTorch code and checkpoints release for OFA-KD: https://arxiv.org/abs/2310.19444☆122Updated last year
- ☆86Updated 2 years ago
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆63Updated 7 months ago
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆106Updated last year
- [ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass☆186Updated last year
- This is the repository for paper: Gradient-based Parameter Selection for Efficient Fine-Tuning☆22Updated last month
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning (CVPR '24)☆48Updated 2 months ago
- [CVPR-2024] Official implementations of CLIP-KD: An Empirical Study of CLIP Model Distillation☆112Updated 10 months ago
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆83Updated last year
- Implementation of AAAI 2022 Paper: Go wider instead of deeper☆32Updated 2 years ago
- [ICCV 2023 oral] This is the official repository for our paper: ''Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning''.☆70Updated last year
- (NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …☆128Updated 5 months ago
- ImageNet-1K data download, processing for using as a dataset☆95Updated 2 years ago
- ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching☆102Updated 11 months ago
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆60Updated last year
- Awesome-Low-Rank-Adaptation☆95Updated 6 months ago
- [CVPR 2024] On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm☆69Updated 2 months ago
- Official code for Scale Decoupled Distillation☆41Updated last year
- Official PyTorch(MMCV) implementation of “Adversarial AutoMixup” (ICLR 2024 spotlight)☆67Updated 6 months ago
- [CVPR-22] This is the official implementation of the paper "Adavit: Adaptive vision transformers for efficient image recognition".☆52Updated 2 years ago
- Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022☆146Updated 2 years ago
- Official code for ICLR 2024 paper, "A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation"☆78Updated last year
- Official Pytorch implementation of "E2VPT: An Effective and Efficient Approach for Visual Prompt Tuning". (ICCV2023)☆68Updated last year
- Convolutional Initialization for Data-Efficient Vision Transformers☆14Updated last year
- ☆36Updated 9 months ago
- The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"☆45Updated 4 months ago
- [CVPR 2023 Highlight] This is the official implementation of "Stitchable Neural Networks".☆248Updated 2 years ago
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆86Updated last year
- Transformers trained on Tiny ImageNet☆54Updated 2 years ago
- The official implementation for paper: Improving Knowledge Distillation via Regularizing Feature Norm and Direction☆18Updated last year