JegZheng / MS-MLPLinks
Pytorch implementation of Mix-Shifting-MLP (MS-MLP)
☆16Updated 3 years ago
Alternatives and similar repositories for MS-MLP
Users that are interested in MS-MLP are comparing it to the libraries listed below
Sorting:
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆28Updated 3 years ago
- MIMIC: Masked Image Modeling with Image Correspondences☆16Updated last year
- ☆25Updated 4 years ago
- [WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling☆55Updated 7 months ago
- A Close Look at Spatial Modeling: From Attention to Convolution☆91Updated 2 years ago
- [ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation☆102Updated 2 years ago
- i-mae Pytorch Repo☆20Updated last year
- A Simple Adaptive Unfolding Network for Hyperspectral Image Reconstruction☆32Updated 2 years ago
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆49Updated 3 years ago
- Log-Polar Space Convolution for Convolutional Neural Networks☆13Updated 2 years ago
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆35Updated last year
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆28Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated 2 years ago
- open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689☆53Updated 3 years ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆87Updated 8 months ago
- ☆31Updated 2 years ago
- This is the code for CVPR2022 paper "Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation"☆19Updated 2 years ago
- Pytorch implementation of our paper accepted by ECCV2022 -- Knowledge Condensation Distillation https://arxiv.org/abs/2207.05409☆30Updated 2 years ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆79Updated 2 years ago
- ☆32Updated last year
- Code for Learning to Zoom and Unzoom (CVPR 2023)☆46Updated 2 years ago
- ☆62Updated 2 years ago
- [TIP] Exploring Effective Factors for Improving Visual In-Context Learning☆19Updated 5 months ago
- Code for You Only Cut Once: Boosting Data Augmentation with a Single Cut, ICML 2022.☆105Updated 2 years ago
- code base for vision transformers☆36Updated 4 years ago
- 🔥 🔥 [WACV2024] Mini but Mighty: Finetuning ViTs with Mini Adapters☆19Updated last year
- [CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention☆118Updated 3 years ago
- ☆16Updated 3 years ago
- ☆72Updated 9 months ago
- How Much Position Information Do Convolutional Neural Networks Encode?☆11Updated 4 years ago