pprp / Vision-Mamba-CIFAR10Links
☆23Updated last year
Alternatives and similar repositories for Vision-Mamba-CIFAR10
Users that are interested in Vision-Mamba-CIFAR10 are comparing it to the libraries listed below
Sorting:
- ☆28Updated 2 years ago
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆71Updated last year
- ☆28Updated 3 years ago
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆32Updated 3 years ago
- To appear in the 11th International Conference on Learning Representations (ICLR 2023).☆18Updated 2 years ago
- The offical implementation of [NeurIPS2024] Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation https://ar…☆46Updated last year
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆87Updated last year
- [ICML2024] DetKDS: Knowledge Distillation Search for Object Detectors☆17Updated last year
- The official project website of "NORM: Knowledge Distillation via N-to-One Representation Matching" (The paper of NORM is published in IC…☆20Updated 2 years ago
- Convolutional Initialization for Data-Efficient Vision Transformers☆16Updated last week
- ResMLP: Feedforward networks for image classification with data-efficient training☆45Updated 4 years ago
- [ECCV 2022] Implementation of the paper "Locality Guidance for Improving Vision Transformers on Tiny Datasets"☆82Updated 3 years ago
- Auto-Prox-AAAI24☆14Updated last year
- [CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference☆30Updated last year
- ☆48Updated 2 years ago
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆106Updated 2 years ago
- Official implementation for "Knowledge Distillation with Refined Logits".☆21Updated last year
- ☆29Updated last year
- Official code for Scale Decoupled Distillation☆43Updated last year
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆76Updated 2 years ago
- [Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…☆41Updated 3 months ago
- [NeurIPS 2024] Search for Efficient LLMs☆15Updated 11 months ago
- [ECCV-2022] Official implementation of MixSKD: Self-Knowledge Distillation from Mixup for Image Recognition && Pytorch Implementations of…☆110Updated 3 years ago
- Implementation of HAT https://arxiv.org/pdf/2204.00993☆51Updated last year
- Implementation of AAAI 2022 Paper: Go wider instead of deeper☆32Updated 3 years ago
- [ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN☆28Updated last year
- Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022☆154Updated 2 years ago
- ☆63Updated 4 years ago
- Source code of our TNNLS paper "Boosting Convolutional Neural Networks with Middle Spectrum Grouped Convolution"☆12Updated 2 years ago
- Official pytorch implementation of NeurIPS 2022 paper, TokenMixup☆48Updated 3 years ago