whai362/PVT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/whai362/PVT)

whai362 / PVT

Official implementation of PVT series

☆1,902

Alternatives and similar repositories for PVT

Users that are interested in PVT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Meituan-AutoML / Twins
View on GitHub
Two simple and effective designs of vision transformer, which is on par with the Swin transformer
☆611Feb 14, 2023Updated 3 years ago
yitu-opensource / T2T-ViT
View on GitHub
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
☆1,194Oct 27, 2023Updated 2 years ago
microsoft / Swin-Transformer
View on GitHub
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
☆16,010Jul 24, 2024Updated 2 years ago
facebookresearch / deit
View on GitHub
Official DeiT repository
☆4,359Mar 15, 2024Updated 2 years ago
facebookresearch / MaskFormer
View on GitHub
Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)
☆1,463Mar 11, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
PeizeSun / SparseR-CNN
View on GitHub
[CVPR2021, PAMI2023] End-to-End Object Detection with Learnable Proposal
☆1,345Apr 30, 2023Updated 3 years ago
fundamentalvision / Deformable-DETR
View on GitHub
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
☆4,004May 16, 2024Updated 2 years ago
whai362 / PVTv2-Seg
View on GitHub
☆64Jan 22, 2022Updated 4 years ago
Meituan-AutoML / CPVT
View on GitHub
☆196Feb 14, 2023Updated 3 years ago
fudan-zvg / SETR
View on GitHub
[CVPR 2021 & IJCV 2024] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
☆1,108Sep 2, 2024Updated last year
ShoufaChen / CycleMLP
View on GitHub
[ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"
☆290Apr 25, 2022Updated 4 years ago
facebookresearch / detr
View on GitHub
End-to-End Object Detection with Transformers
☆15,355Mar 12, 2024Updated 2 years ago
facebookresearch / ConvNeXt
View on GitHub
Code release for ConvNeXt model
☆6,415Jan 8, 2023Updated 3 years ago
lucidrains / vit-pytorch
View on GitHub
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…
☆25,436Jun 22, 2026Updated last month
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
d-li14 / involution
View on GitHub
[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator
☆1,311Jul 16, 2021Updated 5 years ago
microsoft / CSWin-Transformer
View on GitHub
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022
☆586Nov 1, 2023Updated 2 years ago
NVlabs / SegFormer
View on GitHub
Official PyTorch implementation of SegFormer
☆3,599Aug 2, 2024Updated last year
dk-liang / Awesome-Visual-Transformer
View on GitHub
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
☆3,589Jan 7, 2025Updated last year
huggingface / pytorch-image-models
View on GitHub
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…
☆37,012Jul 16, 2026Updated last week
sail-sg / volo
View on GitHub
VOLO: Vision Outlooker for Visual Recognition
☆948Sep 18, 2022Updated 3 years ago
aim-uofa / AdelaiDet
View on GitHub
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
☆3,483Aug 23, 2024Updated last year
zhanghang1989 / ResNeSt
View on GitHub
ResNeSt: Split-Attention Networks
☆3,262Dec 9, 2022Updated 3 years ago
naver-ai / pit
View on GitHub
☆245Jul 23, 2021Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
sail-sg / poolformer
View on GitHub
PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
☆1,363Jun 1, 2024Updated 2 years ago
Megvii-BaseDetection / DeFCN
View on GitHub
End-to-End Object Detection with Fully Convolutional Network
☆494Jan 10, 2022Updated 4 years ago
wofmanaf / ResT
View on GitHub
This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".
☆291Sep 28, 2022Updated 3 years ago
PeizeSun / OneNet
View on GitHub
[ICML2021] What Makes for End-to-End Object Detection
☆641Apr 30, 2023Updated 3 years ago
DengPingFan / Polyp-PVT
View on GitHub
Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers, AIR 2023.
☆262Nov 1, 2023Updated 2 years ago
facebookresearch / LeViT
View on GitHub
LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference
☆624Aug 27, 2022Updated 3 years ago
DingXiaoH / RepVGG
View on GitHub
RepVGG: Making VGG-style ConvNets Great Again
☆3,479Feb 10, 2023Updated 3 years ago
google-research / vision_transformer
View on GitHub
☆12,643Jul 9, 2026Updated 2 weeks ago
Megvii-BaseDetection / BorderDet
View on GitHub
BorderDet: Border Feature for Dense Object Detection(ECCV2020 Oral)
☆429Mar 25, 2021Updated 5 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
microsoft / vision-longformer
View on GitHub
☆249Mar 16, 2022Updated 4 years ago
xieenze / Trans2Seg
View on GitHub
☆157Oct 15, 2022Updated 3 years ago
facebookresearch / mae
View on GitHub
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
☆8,369Jul 23, 2024Updated 2 years ago
xieenze / DetCo
View on GitHub
☆278Feb 23, 2021Updated 5 years ago
YuqingWang1029 / VisTR
View on GitHub
[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers
☆757Jul 15, 2021Updated 5 years ago
microsoft / SimMIM
View on GitHub
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
☆1,047Sep 29, 2022Updated 3 years ago
xingyizhou / CenterNet2
View on GitHub
Two-stage CenterNet
☆1,222Nov 20, 2022Updated 3 years ago