chinhsuanwu / mobilevit-pytorch
A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"
☆509Updated 3 years ago
Alternatives and similar repositories for mobilevit-pytorch:
Users that are interested in mobilevit-pytorch are comparing it to the libraries listed below
- LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference☆606Updated 2 years ago
- ☆227Updated 2 years ago
- [ECCV 2022] Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"☆349Updated 2 years ago
- Unofficial PyTorch implementation of MobileViT based on paper "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Trans…☆115Updated 2 years ago
- EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]☆1,006Updated last year
- PyTorch implementation of EfficientNetV2 family☆465Updated 2 years ago
- Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs (CVPR 2022)☆887Updated 8 months ago
- CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022☆555Updated last year
- Diverse Branch Block: Building a Convolution as an Inception-like Unit☆329Updated last year
- ☆332Updated 2 years ago
- A PyTorch implementation of "CoAtNet: Marrying Convolution and Attention for All Data Sizes"☆373Updated 3 years ago
- ☆313Updated 2 years ago
- The implementation of various lightweight networks by using PyTorch. such as:MobileNetV2,MobileNeXt,GhostNet,ParNet,MobileViT、AdderNet,Sh…☆841Updated 2 years ago
- [CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Ap…☆357Updated last year
- [NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions☆325Updated last year
- MetaFormer Baselines for Vision (TPAMI 2024)☆437Updated 7 months ago
- Implementation of the Swin Transformer in PyTorch.☆814Updated 3 years ago
- Code and models for mobile-former☆119Updated 2 years ago
- ☆634Updated 2 years ago
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆593Updated last year
- [NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"☆548Updated 2 years ago
- PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" [CVPR 2022].☆189Updated 2 years ago
- This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.☆227Updated 2 years ago
- EfficientNetV2 implementation using PyTorch☆125Updated 2 years ago
- This is an official implementation for "Contextual Transformer Networks for Visual Recognition".☆527Updated 3 years ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆281Updated 2 years ago
- Bottleneck Transformers for Visual Recognition☆275Updated 3 years ago
- [NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification☆586Updated last year
- RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality (CVPR 2022)☆306Updated last year
- [NeurIPS 2021] You Only Look at One Sequence☆849Updated 2 years ago