JegZheng / MS-MLPLinks
Pytorch implementation of Mix-Shifting-MLP (MS-MLP)
☆16Updated 3 years ago
Alternatives and similar repositories for MS-MLP
Users that are interested in MS-MLP are comparing it to the libraries listed below
Sorting:
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆29Updated 3 years ago
- ☆25Updated 4 years ago
- MIMIC: Masked Image Modeling with Image Correspondences☆16Updated last year
- open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689☆53Updated 3 years ago
- [ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation☆102Updated 2 years ago
- Code for Learning to Zoom and Unzoom (CVPR 2023)☆47Updated 2 years ago
- [WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling☆55Updated 7 months ago
- A Close Look at Spatial Modeling: From Attention to Convolution☆92Updated 3 years ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆28Updated last year
- ☆31Updated 2 years ago
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆35Updated last year
- The official implementation of ADDP (ICLR 2024)☆12Updated last year
- A Simple Adaptive Unfolding Network for Hyperspectral Image Reconstruction☆32Updated 2 years ago
- Log-Polar Space Convolution for Convolutional Neural Networks☆13Updated 3 years ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆80Updated 2 years ago
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆49Updated 3 years ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆87Updated 8 months ago
- ☆62Updated 2 years ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆35Updated last year
- i-mae Pytorch Repo☆20Updated last year
- How Much Position Information Do Convolutional Neural Networks Encode?☆11Updated 4 years ago
- code base for vision transformers☆36Updated 4 years ago
- [CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention☆118Updated 3 years ago
- This is the code for CVPR2022 paper "Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation"☆19Updated 2 years ago
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆82Updated last year
- Original PyTorch implementation of the paper "Semantic Segmentation under Adverse Conditions: A Weather and Nighttime-aware Synthetic Dat…☆28Updated last year
- DALL-E for Detection: Language-driven Compositional Image Synthesis for Object Detection☆21Updated 2 years ago
- FFNet: MetaMixer-based Efficient Convolutional Mixer Design☆31Updated 9 months ago
- Official codes for ConMIM (ICLR 2023)☆58Updated 2 years ago
- ☆27Updated 3 years ago