Kedreamix / MAE-for-CIFAR
MAE for CIFAR,由于可用资源有限,我们仅在 cifar10 上测试模型。我们主要想重现这样的结果:使用 MAE 预训练 ViT 可以比直接使用标签进行监督学习训练获得更好的结果。这应该是自我监督学习比监督学习更有效的数据的证据。
☆58Updated last year
Related projects ⓘ
Alternatives and complementary repositories for MAE-for-CIFAR
- GroupMixAttention and GroupMixFormer☆112Updated 10 months ago
- ☆79Updated last year
- visualization:filter、feature map、attention map、image-mask、grad-cam、human keypoint、guided-backpro☆94Updated last year
- Official implement for ICML2023 paper: "A Closer Look at Self-Supervised Lightweight Vision Transformers"☆110Updated last year
- TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition☆157Updated 11 months ago
- Wavelet Convolutions for Large Receptive Fields. ECCV 2024.☆262Updated this week
- 'NKD and USKD' (ICCV 2023) and 'ViTKD' (CVPRW 2024)☆213Updated last year
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆230Updated last month
- CAM', 'ScoreCAM', 'SSCAM', 'ISCAM' 'GradCAM', 'GradCAMpp', 'SmoothGradCAMpp', 'XGradCAM', 'LayerCAM' using by PyTorch.☆68Updated 3 years ago
- The official code of "Rethinking Local Perception in Lightweight Vision Transformer"☆85Updated last year
- [NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"☆326Updated 2 years ago
- [CVPR 2024] Code release for TransNeXt model☆424Updated 4 months ago
- ☆111Updated 9 months ago
- PyTorch code and checkpoints release for OFA-KD: https://arxiv.org/abs/2310.19444☆95Updated 6 months ago
- (CVPR2024)RMT: Retentive Networks Meet Vision Transformer☆288Updated 3 months ago
- [CVPR 2024] Rewrite the Stars☆284Updated 6 months ago
- CMT Pytorch implementation of our CVPR 2022 paper CMT: Convolutional Neural Networks Meet Vision Transformers (https://arxiv.org/pdf/2107…☆93Updated 2 years ago
- Codes for ECCV2022 paper - contrastive deep supervision☆68Updated 2 years ago
- 这是一个clip-pytorch的模型,可以训练自己的数据集。☆177Updated last year
- 🕹️SCConv: Spatial and Channel Reconstruction Convolution for Feature Redundancy☆274Updated 2 months ago
- iFormer: Inception Transformer☆242Updated last year
- ☆128Updated 4 months ago
- ☆123Updated 7 months ago
- ☆195Updated 2 months ago
- Official ImageNet Model repository☆216Updated last year
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆175Updated 3 months ago
- Cross-modal few-shot adaptation with CLIP☆315Updated 7 months ago
- ReViT - Residual Attention Vision Transformer☆27Updated 8 months ago
- Scattering Vision Transformer☆49Updated 8 months ago
- ☆240Updated 11 months ago