pengzhiliang/Conformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pengzhiliang/Conformer)

pengzhiliang / Conformer

Official code for Conformer: Local Features Coupling Global Representations for Visual Recognition

☆600

Alternatives and similar repositories for Conformer

Users that are interested in Conformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sooftware / conformer
View on GitHub
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
☆1,130Jun 29, 2026Updated 3 weeks ago
JDAI-CV / CoTNet
View on GitHub
This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
☆538Aug 8, 2021Updated 4 years ago
whai362 / PVT
View on GitHub
Official implementation of PVT series
☆1,902Oct 27, 2022Updated 3 years ago
ShoufaChen / CycleMLP
View on GitHub
[ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"
☆290Apr 25, 2022Updated 4 years ago
microsoft / Swin-Transformer
View on GitHub
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
☆16,003Jul 24, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Meituan-AutoML / Twins
View on GitHub
Two simple and effective designs of vision transformer, which is on par with the Swin transformer
☆611Feb 14, 2023Updated 3 years ago
vasgaowei / TS-CAM
View on GitHub
Codes for TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.
☆143Feb 16, 2023Updated 3 years ago
wofmanaf / ResT
View on GitHub
This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".
☆291Sep 28, 2022Updated 3 years ago
krushi1992 / MOA-transformer
View on GitHub
☆49Jan 23, 2022Updated 4 years ago
blackfeather-wang / Dynamic-Vision-Transformer
View on GitHub
Accelerating T2t-ViT by 1.6-3.6x.
☆260Nov 25, 2021Updated 4 years ago
ACheun9 / Pytorch-implementation-of-Mobile-Former
View on GitHub
Simple implementation of Mobile-Former on Pytorch
☆108Sep 26, 2021Updated 4 years ago
zh460045050 / SNL_ICCV2021
View on GitHub
Official implementation of the paper ``Unifying Nonlocal Blocks for Neural Networks'' (ICCV'21)
☆99Mar 10, 2022Updated 4 years ago
dingmyu / davit
View on GitHub
[ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"
☆378Feb 13, 2024Updated 2 years ago
pengzhiliang / MAE-pytorch
View on GitHub
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
☆2,691Jul 25, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
facebookresearch / deit
View on GitHub
Official DeiT repository
☆4,358Mar 15, 2024Updated 2 years ago
yitu-opensource / T2T-ViT
View on GitHub
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
☆1,194Oct 27, 2023Updated 2 years ago
LeapLabTHU / ACmix
View on GitHub
Official repository of ACmix (CVPR2022)
☆412Apr 25, 2022Updated 4 years ago
microsoft / CSWin-Transformer
View on GitHub
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022
☆585Nov 1, 2023Updated 2 years ago
xmu-xiaoma666 / External-Attention-pytorch
View on GitHub
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.…
☆12,182Mar 16, 2026Updated 4 months ago
lucidrains / vit-pytorch
View on GitHub
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…
☆25,423Jun 22, 2026Updated last month
hkzhang-git / ParC-Net
View on GitHub
[ECCV 2022] Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"
☆359Dec 14, 2022Updated 3 years ago
leaderj1001 / BottleneckTransformers
View on GitHub
Bottleneck Transformers for Visual Recognition
☆279Mar 14, 2021Updated 5 years ago
OliverRensu / Shunted-Transformer
View on GitHub
☆216Dec 17, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
huggingface / pytorch-image-models
View on GitHub
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…
☆36,993Updated this week
huawei-noah / Efficient-AI-Backbones
View on GitHub
Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.
☆4,416Mar 15, 2025Updated last year
zihangJiang / TokenLabeling
View on GitHub
Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"
☆436Sep 5, 2023Updated 2 years ago
mlpc-ucsd / CoaT
View on GitHub
(ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers
☆236Feb 3, 2022Updated 4 years ago
sail-sg / volo
View on GitHub
VOLO: Vision Outlooker for Visual Recognition
☆948Sep 18, 2022Updated 3 years ago
jaketae / conformer
View on GitHub
PyTorch implementation of Conformer: Convolution-augmented Transformer for Speech Recognition
☆18Apr 25, 2021Updated 5 years ago
zhoudaquan / Refiner_ViT
View on GitHub
☆110Sep 15, 2021Updated 4 years ago
FlyEgle / CMT-pytorch
View on GitHub
CMT: Convolutional Neural Networks Meet Vision Transformers
☆121Nov 11, 2021Updated 4 years ago
VDIGPKU / CBNetV2
View on GitHub
[TIP 2022] CBNetV2: A Composite Backbone Network Architecture for Object Detection
☆394Oct 23, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
facebookresearch / convit
View on GitHub
Code for the Convolutional Vision Transformer (ConViT)
☆474Oct 25, 2021Updated 4 years ago
sail-sg / poolformer
View on GitHub
PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
☆1,363Jun 1, 2024Updated 2 years ago
fudan-zvg / SETR
View on GitHub
[CVPR 2021 & IJCV 2024] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
☆1,108Sep 2, 2024Updated last year
Res2Net / Res2Net-PretrainedModels
View on GitHub
(ImageNet pretrained models) The official pytorch implemention of the TPAMI paper "Res2Net: A New Multi-scale Backbone Architecture"
☆1,114Dec 8, 2022Updated 3 years ago
microsoft / CvT
View on GitHub
This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
☆609May 16, 2023Updated 3 years ago
amusi / CVPR2026-Papers-with-Code
View on GitHub
CVPR 2026 论文和开源项目合集
☆22,760Mar 8, 2026Updated 4 months ago
cheerss / CrossFormer
View on GitHub
The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI
☆403Jan 14, 2024Updated 2 years ago