rishikksh20/convolution-vision-transformers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rishikksh20/convolution-vision-transformers)

rishikksh20 / convolution-vision-transformers

PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers

☆226

Alternatives and similar repositories for convolution-vision-transformers

Users that are interested in convolution-vision-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

microsoft / CvT
View on GitHub
This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
☆609May 16, 2023Updated 3 years ago
rosinality / vision-transformers-pytorch
View on GitHub
Implementation of various Vision Transformers I found interesting
☆84May 1, 2021Updated 5 years ago
zihangJiang / TokenLabeling
View on GitHub
Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"
☆436Sep 5, 2023Updated 2 years ago
lucidrains / bottleneck-transformer-pytorch
View on GitHub
Implementation of Bottleneck Transformer in Pytorch
☆678Sep 20, 2021Updated 4 years ago
wilile26811249 / CMT_CNN-meet-Vision-Transformer
View on GitHub
A PyTorch implementation of CMT based on paper CMT: Convolutional Neural Networks Meet Vision Transformers.
☆72Mar 18, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
facebookresearch / convit
View on GitHub
Code for the Convolutional Vision Transformer (ConViT)
☆474Oct 25, 2021Updated 4 years ago
ofsoundof / LocalViT
View on GitHub
☆118Jan 17, 2026Updated 6 months ago
rishikksh20 / CeiT-pytorch
View on GitHub
Implementation of Convolutional enhanced image Transformer
☆106Mar 27, 2021Updated 5 years ago
microsoft / vision-longformer
View on GitHub
☆249Mar 16, 2022Updated 4 years ago
rishikksh20 / CoaT-pytorch
View on GitHub
CoaT: Co-Scale Conv-Attentional Image Transformers
☆15Apr 20, 2021Updated 5 years ago
yitu-opensource / T2T-ViT
View on GitHub
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
☆1,194Oct 27, 2023Updated 2 years ago
SHI-Labs / Compact-Transformers
View on GitHub
Escaping the Big Data Paradigm with Compact Transformers, 2021 (Train your Vision Transformers in 30 mins on CIFAR-10 with a single GPU!)
☆546Nov 5, 2024Updated last year
naver-ai / pit
View on GitHub
☆245Jul 23, 2021Updated 4 years ago
leoxiaobin / CvT
View on GitHub
This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
☆229Jul 4, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
whai362 / PVT
View on GitHub
Official implementation of PVT series
☆1,902Oct 27, 2022Updated 3 years ago
lucidrains / vit-pytorch
View on GitHub
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…
☆25,423Jun 22, 2026Updated 3 weeks ago
sail-sg / volo
View on GitHub
VOLO: Vision Outlooker for Visual Recognition
☆948Sep 18, 2022Updated 3 years ago
d-li14 / involution
View on GitHub
[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator
☆1,311Jul 16, 2021Updated 5 years ago
mlpc-ucsd / CoaT
View on GitHub
(ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers
☆236Feb 3, 2022Updated 4 years ago
Meituan-AutoML / Twins
View on GitHub
Two simple and effective designs of vision transformer, which is on par with the Swin transformer
☆611Feb 14, 2023Updated 3 years ago
facebookresearch / xcit
View on GitHub
Official code Cross-Covariance Image Transformer (XCiT)
☆681Sep 28, 2021Updated 4 years ago
lucidrains / halonet-pytorch
View on GitHub
Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones
☆199Mar 24, 2021Updated 5 years ago
Yancccccc / HyFormer
View on GitHub
HyFormer: Hybrid Transformer and CNN For Pixel-level Multispectral Image Classification
☆16Feb 15, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
szq0214 / MEAL-V2
View on GitHub
MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks. In NeurIPS 2020 workshop.
☆700Dec 24, 2021Updated 4 years ago
dk-liang / Awesome-Visual-Transformer
View on GitHub
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
☆3,587Jan 7, 2025Updated last year
YimianDai / open-atac
View on GitHub
code and trained models for "Attention as Activation"
☆19Jul 16, 2020Updated 6 years ago
AlexeyAB / SPVT-Transformer
View on GitHub
☆13Nov 7, 2021Updated 4 years ago
lucidrains / STAM-pytorch
View on GitHub
Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification
☆133Apr 1, 2021Updated 5 years ago
zhoudaquan / Refiner_ViT
View on GitHub
☆110Sep 15, 2021Updated 4 years ago
ShoufaChen / CycleMLP
View on GitHub
[ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"
☆290Apr 25, 2022Updated 4 years ago
DingXiaoH / RepMLP
View on GitHub
RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality (CVPR 2022)
☆307Feb 10, 2023Updated 3 years ago
microsoft / Swin-Transformer
View on GitHub
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
☆16,003Jul 24, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mlpc-ucsd / BERT_Convolutions
View on GitHub
(ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.
☆21Jul 13, 2022Updated 4 years ago
Sara-Ahmed / SiT
View on GitHub
Self-supervised vIsion Transformer (SiT)
☆335Dec 24, 2022Updated 3 years ago
berniwal / swin-transformer-pytorch
View on GitHub
Implementation of the Swin Transformer in PyTorch.
☆862Mar 29, 2021Updated 5 years ago
theodoruszq / PTSNet
View on GitHub
Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation
☆117Nov 23, 2019Updated 6 years ago
facebookresearch / deit
View on GitHub
Official DeiT repository
☆4,358Mar 15, 2024Updated 2 years ago
kevin-ssy / ViP
View on GitHub
Official implementation of the paper Visual Parser: Representing Part-whole Hierarchies with Transformers
☆120Aug 12, 2021Updated 4 years ago
danczs / Visformer
View on GitHub
☆135Feb 10, 2023Updated 3 years ago