facebookresearch/convit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/convit)

facebookresearch / convit

Code for the Convolutional Vision Transformer (ConViT)

☆474

Alternatives and similar repositories for convit

Users that are interested in convit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / deit
View on GitHub
Official DeiT repository
☆4,359Mar 15, 2024Updated 2 years ago
facebookresearch / xcit
View on GitHub
Official code Cross-Covariance Image Transformer (XCiT)
☆681Sep 28, 2021Updated 4 years ago
microsoft / CvT
View on GitHub
This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
☆609May 16, 2023Updated 3 years ago
facebookresearch / LeViT
View on GitHub
LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference
☆624Aug 27, 2022Updated 3 years ago
sail-sg / volo
View on GitHub
VOLO: Vision Outlooker for Visual Recognition
☆948Sep 18, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zihangJiang / TokenLabeling
View on GitHub
Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"
☆436Sep 5, 2023Updated 2 years ago
yitu-opensource / T2T-ViT
View on GitHub
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
☆1,194Oct 27, 2023Updated 2 years ago
mlpc-ucsd / CoaT
View on GitHub
(ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers
☆236Feb 3, 2022Updated 4 years ago
whai362 / PVT
View on GitHub
Official implementation of PVT series
☆1,902Oct 27, 2022Updated 3 years ago
naver-ai / pit
View on GitHub
☆245Jul 23, 2021Updated 4 years ago
wofmanaf / ResT
View on GitHub
This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".
☆291Sep 28, 2022Updated 3 years ago
microsoft / vision-longformer
View on GitHub
☆249Mar 16, 2022Updated 4 years ago
Meituan-AutoML / Twins
View on GitHub
Two simple and effective designs of vision transformer, which is on par with the Swin transformer
☆611Feb 14, 2023Updated 3 years ago
houqb / VisionPermutator
View on GitHub
MLP-Like Vision Permutator for Visual Recognition (PyTorch)
☆192Mar 31, 2022Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
danczs / Visformer
View on GitHub
☆135Feb 10, 2023Updated 3 years ago
d-li14 / involution
View on GitHub
[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator
☆1,311Jul 16, 2021Updated 5 years ago
rishikksh20 / convolution-vision-transformers
View on GitHub
PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers
☆226May 26, 2021Updated 5 years ago
ShoufaChen / CycleMLP
View on GitHub
[ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"
☆290Apr 25, 2022Updated 4 years ago
facebookresearch / ConvNeXt
View on GitHub
Code release for ConvNeXt model
☆6,414Jan 8, 2023Updated 3 years ago
locuslab / convmixer
View on GitHub
Implementation of ConvMixer for "Patches Are All You Need? 🤷"
☆1,081Nov 11, 2022Updated 3 years ago
blackfeather-wang / Dynamic-Vision-Transformer
View on GitHub
Accelerating T2t-ViT by 1.6-3.6x.
☆260Nov 25, 2021Updated 4 years ago
lucidrains / vit-pytorch
View on GitHub
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…
☆25,428Jun 22, 2026Updated last month
facebookresearch / vissl
View on GitHub
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
☆3,295Mar 3, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Sense-X / UniFormer
View on GitHub
[ICLR2022] official implementation of UniFormer
☆906Mar 29, 2024Updated 2 years ago
microsoft / Focal-Transformer
View on GitHub
[NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"
☆559Mar 27, 2022Updated 4 years ago
lukemelas / do-you-even-need-attention
View on GitHub
Is the attention layer even necessary? (https://arxiv.org/abs/2105.02723)
☆485May 7, 2021Updated 5 years ago
rishikksh20 / CeiT-pytorch
View on GitHub
Implementation of Convolutional enhanced image Transformer
☆106Mar 27, 2021Updated 5 years ago
raoyongming / DynamicViT
View on GitHub
[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
☆668Jul 11, 2023Updated 3 years ago
google-research / vision_transformer
View on GitHub
☆12,635Jul 9, 2026Updated last week
lucidrains / halonet-pytorch
View on GitHub
Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones
☆199Mar 24, 2021Updated 5 years ago
facebookresearch / dino
View on GitHub
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
☆7,608Jul 3, 2024Updated 2 years ago
facebookresearch / suncet
View on GitHub
Code to reproduce the results in the FAIR research papers "Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting V…
☆494Apr 28, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
microsoft / Swin-Transformer
View on GitHub
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
☆16,004Jul 24, 2024Updated last year
Meituan-AutoML / CPVT
View on GitHub
☆196Feb 14, 2023Updated 3 years ago
zhoudaquan / Refiner_ViT
View on GitHub
☆110Sep 15, 2021Updated 4 years ago
sail-sg / poolformer
View on GitHub
PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
☆1,363Jun 1, 2024Updated 2 years ago
VITA-Group / AsViT
View on GitHub
[ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…
☆76Feb 21, 2022Updated 4 years ago
huggingface / pytorch-image-models
View on GitHub
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…
☆37,000Updated this week
mulinmeng / Shuffle-Transformer
View on GitHub
☆98Apr 27, 2022Updated 4 years ago