OPTML-Group / Robust-MoE-CNN
[ICCV23] Robust Mixture-of-Expert Training for Convolutional Neural Networks by Yihua Zhang, Ruisi Cai, Tianlong Chen, Guanhua Zhang, Huan Zhang, Pin-Yu Chen, Shiyu Chang, Zhangyang (Atlas) Wang, Sijia Liu
☆42Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Robust-MoE-CNN
- ☆75Updated last year
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆68Updated this week
- Implementation of AAAI 2022 Paper: Go wider instead of deeper☆32Updated 2 years ago
- Awesome-Low-Rank-Adaptation☆33Updated 3 weeks ago
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆54Updated last month
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆28Updated 2 years ago
- The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"☆36Updated 3 weeks ago
- ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching☆94Updated 5 months ago
- (NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …☆119Updated this week
- PyTorch code and checkpoints release for OFA-KD: https://arxiv.org/abs/2310.19444☆95Updated 6 months ago
- [ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass☆171Updated last year
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆76Updated 9 months ago
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆82Updated 11 months ago
- Fine-tuning Vision Transformers on various classification datasets☆91Updated 2 months ago
- [CVPR-22] This is the official implementation of the paper "Adavit: Adaptive vision transformers for efficient image recognition".☆49Updated 2 years ago
- Official code for Scale Decoupled Distillation☆34Updated 7 months ago
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆80Updated 7 months ago
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆99Updated last year
- Implementation of HAT https://arxiv.org/pdf/2204.00993☆46Updated 7 months ago
- Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022☆138Updated last year
- [ICCV 2023 oral] This is the official repository for our paper: ''Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning''.☆62Updated last year
- ☆73Updated 4 months ago
- [CVPR-2024] Official implementations of CLIP-KD: An Empirical Study of CLIP Model Distillation☆72Updated 4 months ago
- Official PyTorch implementation of Which Tokens to Use? Investigating Token Reduction in Vision Transformers presented at ICCV 2023 NIVT …☆30Updated last year
- [ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging tech…☆87Updated last year
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆71Updated 2 months ago
- ImageNet-1K data download, processing for using as a dataset☆65Updated last year
- [ECCV 2022] Implementation of the paper "Locality Guidance for Improving Vision Transformers on Tiny Datasets"☆76Updated 2 years ago
- PyTorch implementation of paper "Sparse Parameterization for Epitomic Dataset Distillation" in NeurIPS 2023.☆20Updated 4 months ago
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning☆32Updated 3 months ago