pprp / Vision-Mamba-CIFAR10Links
☆23Updated last year
Alternatives and similar repositories for Vision-Mamba-CIFAR10
Users that are interested in Vision-Mamba-CIFAR10 are comparing it to the libraries listed below
Sorting:
- ☆28Updated last year
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆70Updated last year
- The official project website of "NORM: Knowledge Distillation via N-to-One Representation Matching" (The paper of NORM is published in IC…☆20Updated 2 years ago
- [ICML2024] DetKDS: Knowledge Distillation Search for Object Detectors☆16Updated last year
- To appear in the 11th International Conference on Learning Representations (ICLR 2023).☆18Updated 2 years ago
- The offical implementation of [NeurIPS2024] Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation https://ar…☆44Updated 10 months ago
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆106Updated 2 years ago
- ☆27Updated 2 years ago
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆84Updated last year
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆32Updated 3 years ago
- Official implementation for "Knowledge Distillation with Refined Logits".☆16Updated last year
- ☆47Updated 2 years ago
- Convolutional Initialization for Data-Efficient Vision Transformers☆16Updated last year
- Official code for Scale Decoupled Distillation☆41Updated last year
- ResMLP: Feedforward networks for image classification with data-efficient training☆45Updated 4 years ago
- [TPAMI-2023] Official implementations of L-MCL: Online Knowledge Distillation via Mutual Contrastive Learning for Visual Recognition☆26Updated 2 years ago
- Implementation of HAT https://arxiv.org/pdf/2204.00993☆51Updated last year
- [CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference☆30Updated last year
- [NeurIPS 2024] Search for Efficient LLMs☆15Updated 9 months ago
- [ECCV 2022] Implementation of the paper "Locality Guidance for Improving Vision Transformers on Tiny Datasets"☆82Updated 3 years ago
- This is an official implementation of our NeurIPS 2022 paper "Bridging the Gap Between Vision Transformers and Convolutional Neural Netwo…☆58Updated last month
- [ICCV23] Robust Mixture-of-Expert Training for Convolutional Neural Networks by Yihua Zhang, Ruisi Cai, Tianlong Chen, Guanhua Zhang, Hua…☆65Updated 2 years ago
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆76Updated last year
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…☆25Updated 3 years ago
- ☆63Updated 4 years ago
- Official pytorch implementation of NeurIPS 2022 paper, TokenMixup☆48Updated 2 years ago
- [ECCV-2022] Official implementation of MixSKD: Self-Knowledge Distillation from Mixup for Image Recognition && Pytorch Implementations of…☆109Updated 2 years ago
- PELA: Learning Parameter-Efficient Models with Low-Rank Approximation [CVPR 2024]☆18Updated last year
- [NeurIPS 2022] “M³ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design”, Hanxue …☆131Updated 2 years ago
- The codebase for paper "PPT: Token Pruning and Pooling for Efficient Vision Transformer"☆27Updated 11 months ago