wkcn / TinyViT
[ECCV 2022] TinyViT: Fast Pretraining Distillation for Small Vision Transformers (https://github.com/microsoft/Cream/tree/main/TinyViT)
☆77Updated last year
Alternatives and similar repositories for TinyViT:
Users that are interested in TinyViT are comparing it to the libraries listed below
- [ICCV'23] Official repository of paper SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applicatio…☆279Updated last year
- Code Implementation of EfficientVMamba☆207Updated last year
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆82Updated last year
- LSNet: See Large, Focus Small [CVPR 2025]☆84Updated last month
- ImageNet-1K data download, processing for using as a dataset☆95Updated 2 years ago
- 1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundatio…☆221Updated 8 months ago
- [ECCV 2024] Isomorphic Pruning for Vision Models☆68Updated 9 months ago
- The official project website of "KernelWarehouse: Rethinking the Design of Dynamic Convolution" (KW for short, published in ICML 2024)☆100Updated 10 months ago
- An unofficial implementation of MobileNetV4 in Pytorch☆179Updated last month
- Code and models for mobile-former☆123Updated 2 years ago
- [CVPR 2024] SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design☆98Updated 10 months ago
- [CVPR 2024] Rewrite the Stars☆376Updated last year
- [CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything☆71Updated 10 months ago
- ☆246Updated 2 years ago
- InceptionNeXt: When Inception Meets ConvNeXt (CVPR 2024)☆300Updated 5 months ago
- MetaFormer Baselines for Vision (TPAMI 2024)☆457Updated 11 months ago
- One summary of efficient segment anything models☆95Updated 9 months ago
- Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense…☆284Updated 3 months ago
- Mamba in Vision: A Comprehensive Survey of Techniques and Applications☆110Updated 7 months ago
- Zero-label image classification via OpenCLIP knowledge distillation☆125Updated last year
- [ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures☆456Updated 2 months ago
- Official ImageNet Model repository☆250Updated 2 years ago
- [ECCV 2022] EdgeViT: Competing Light-weight CNNs on Mobile Devices with Vision Transformers☆107Updated 2 years ago
- GroupMixAttention and GroupMixFormer☆116Updated last year
- [ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention☆849Updated last month
- Official repository of paper titled "CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications…☆57Updated 4 months ago
- [CVPR 2025 Oral] OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels☆177Updated this week
- ☆75Updated last year
- [CVPR25] Official implementation of `MobileMamba: Lightweight Multi-Receptive Visual Mamba Network.'☆195Updated last month
- RepViT: Revisiting Mobile CNN From ViT Perspective [CVPR 2024] and RepViT-SAM: Towards Real-Time Segmenting Anything☆941Updated 10 months ago