A paper list of some recent Transformer-based CV works.
☆1,436Nov 19, 2025Updated 4 months ago
Alternatives and similar repositories for Transformer-in-Computer-Vision
Users that are interested in Transformer-in-Computer-Vision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)☆3,574Jan 7, 2025Updated last year
- Recent Transformer-based CV and related works.☆1,339Aug 22, 2023Updated 2 years ago
- An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites☆5,024Jul 30, 2024Updated last year
- [T-PAMI-2024] Transformer-Based Visual Segmentation: A Survey☆756Aug 25, 2024Updated last year
- Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation☆576Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)☆1,395Jul 4, 2024Updated last year
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,538Mar 18, 2026Updated last week
- A collection of papers about Referring Image Segmentation.☆812Jan 28, 2026Updated last month
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆15,782Jul 24, 2024Updated last year
- A curated list of awesome vision and language resources (still under construction... stay tuned!)☆559Nov 4, 2024Updated last year
- CVPR 2026 论文和开源项目合集☆22,239Mar 8, 2026Updated 2 weeks ago
- A paper list of some recent Mamba-based CV works.☆457Mar 3, 2026Updated 3 weeks ago
- ☆12,381Mar 3, 2026Updated 3 weeks ago
- Official DeiT repository☆4,327Mar 15, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code release for ConvNeXt model☆6,326Jan 8, 2023Updated 3 years ago
- A Survey on Transformer in CV.☆192Jun 18, 2023Updated 2 years ago
- Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, I…☆12,701Apr 7, 2025Updated 11 months ago
- This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).☆1,215Updated this week
- Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Atte…☆926Apr 17, 2024Updated last year
- A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).☆861Jul 10, 2024Updated last year
- Summary of related papers on visual attention. Related code will be released based on Jittor gradually.☆2,844Oct 20, 2024Updated last year
- PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)☆1,366Jun 1, 2024Updated last year
- 🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.…☆12,170Mar 16, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Recent Advances in Vision and Language PreTrained Models (VL-PTMs)☆1,155Aug 19, 2022Updated 3 years ago
- A curated list of prompt-based paper in computer vision and vision-language learning.☆925Dec 18, 2023Updated 2 years ago
- Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022☆1,176May 15, 2024Updated last year
- [CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize …☆1,984Jan 24, 2024Updated 2 years ago
- Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).☆1,230Jun 28, 2024Updated last year
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆1,030Sep 29, 2022Updated 3 years ago
- End-to-End Object Detection with Transformers☆15,181Mar 12, 2024Updated 2 years ago
- [NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification☆652Jul 11, 2023Updated 2 years ago
- Vision Transformers with Hierarchical Attention☆103Sep 11, 2025Updated 6 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A collection of resources and papers on Diffusion Models☆12,297Aug 1, 2024Updated last year
- Implementation of vision transformer. ⭐⭐⭐☆33Oct 26, 2021Updated 4 years ago
- detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.☆2,274Sep 11, 2025Updated 6 months ago
- This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.☆603May 16, 2023Updated 2 years ago
- A collection of resources on applications of Transformers in Medical Imaging.☆1,286Apr 18, 2024Updated last year
- Official implementation of PVT series☆1,889Oct 27, 2022Updated 3 years ago
- Reading list for research topics in Masked Image Modeling☆335Dec 3, 2024Updated last year