Official code for Conformer: Local Features Coupling Global Representations for Visual Recognition
☆600Oct 31, 2021Updated 4 years ago
Alternatives and similar repositories for Conformer
Users that are interested in Conformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)☆1,127Jan 5, 2026Updated 5 months ago
- This is an official implementation for "Contextual Transformer Networks for Visual Recognition".☆538Aug 8, 2021Updated 4 years ago
- Official implementation of PVT series☆1,899Oct 27, 2022Updated 3 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆290Apr 25, 2022Updated 4 years ago
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆15,969Jul 24, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆610Feb 14, 2023Updated 3 years ago
- Codes for TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.☆143Feb 16, 2023Updated 3 years ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆292Sep 28, 2022Updated 3 years ago
- ☆49Jan 23, 2022Updated 4 years ago
- Accelerating T2t-ViT by 1.6-3.6x.☆260Nov 25, 2021Updated 4 years ago
- Simple implementation of Mobile-Former on Pytorch☆108Sep 26, 2021Updated 4 years ago
- Official implementation of the paper ``Unifying Nonlocal Blocks for Neural Networks'' (ICCV'21)☆99Mar 10, 2022Updated 4 years ago
- [ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"☆376Feb 13, 2024Updated 2 years ago
- Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners☆2,693Jul 25, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official DeiT repository☆4,349Mar 15, 2024Updated 2 years ago
- ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet☆1,192Oct 27, 2023Updated 2 years ago
- Official repository of ACmix (CVPR2022)☆413Apr 25, 2022Updated 4 years ago
- CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022☆584Nov 1, 2023Updated 2 years ago
- 🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.…☆12,178Mar 16, 2026Updated 3 months ago
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…☆25,335Jun 22, 2026Updated last week
- [ECCV 2022] Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"☆358Dec 14, 2022Updated 3 years ago
- ☆216Dec 17, 2021Updated 4 years ago
- Bottleneck Transformers for Visual Recognition☆279Mar 14, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,900Jun 19, 2026Updated last week
- Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.☆4,417Mar 15, 2025Updated last year
- Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"☆435Sep 5, 2023Updated 2 years ago
- (ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers☆236Feb 3, 2022Updated 4 years ago
- VOLO: Vision Outlooker for Visual Recognition☆948Sep 18, 2022Updated 3 years ago
- PyTorch implementation of Conformer: Convolution-augmented Transformer for Speech Recognition☆18Apr 25, 2021Updated 5 years ago
- ☆110Sep 15, 2021Updated 4 years ago
- CMT: Convolutional Neural Networks Meet Vision Transformers☆121Nov 11, 2021Updated 4 years ago
- [TIP 2022] CBNetV2: A Composite Backbone Network Architecture for Object Detection☆393Oct 23, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for the Convolutional Vision Transformer (ConViT)☆474Oct 25, 2021Updated 4 years ago
- PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)☆1,362Jun 1, 2024Updated 2 years ago
- [CVPR 2021 & IJCV 2024] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers☆1,109Sep 2, 2024Updated last year
- This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.☆607May 16, 2023Updated 3 years ago
- (ImageNet pretrained models) The official pytorch implemention of the TPAMI paper "Res2Net: A New Multi-scale Backbone Architecture"☆1,115Dec 8, 2022Updated 3 years ago
- CVPR 2026 论文和开源项目合集☆22,723Mar 8, 2026Updated 3 months ago
- The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI☆403Jan 14, 2024Updated 2 years ago