A paper list of some recent Transformer-based CV works.
☆1,431Nov 19, 2025Updated 3 months ago
Alternatives and similar repositories for Transformer-in-Computer-Vision
Users that are interested in Transformer-in-Computer-Vision are comparing it to the libraries listed below
Sorting:
- Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)☆3,565Jan 7, 2025Updated last year
- Recent Transformer-based CV and related works.☆1,340Aug 22, 2023Updated 2 years ago
- An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites☆5,016Jul 30, 2024Updated last year
- [T-PAMI-2024] Transformer-Based Visual Segmentation: A Survey☆759Aug 25, 2024Updated last year
- Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)☆1,397Jul 4, 2024Updated last year
- Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation☆571Jan 30, 2026Updated last month
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,420Feb 26, 2026Updated last week
- A collection of papers about Referring Image Segmentation.☆809Jan 28, 2026Updated last month
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆15,721Jul 24, 2024Updated last year
- CVPR 2026 论文和开源项目合集☆21,890Feb 25, 2026Updated last week
- A curated list of awesome vision and language resources (still under construction... stay tuned!)☆560Nov 4, 2024Updated last year
- This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).☆1,214Updated this week
- Official DeiT repository☆4,325Mar 15, 2024Updated last year
- Code release for ConvNeXt model☆6,300Jan 8, 2023Updated 3 years ago
- A Survey on Transformer in CV.☆192Jun 18, 2023Updated 2 years ago
- ☆12,332Updated this week
- Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, I…☆12,643Apr 7, 2025Updated 10 months ago
- Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Atte…☆926Apr 17, 2024Updated last year
- Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).☆1,232Jun 28, 2024Updated last year
- Summary of related papers on visual attention. Related code will be released based on Jittor gradually.☆2,846Oct 20, 2024Updated last year
- Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022☆1,175May 15, 2024Updated last year
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆1,026Sep 29, 2022Updated 3 years ago
- PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)☆1,367Jun 1, 2024Updated last year
- Reading list for research topics in Masked Image Modeling☆338Dec 3, 2024Updated last year
- 🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.…☆12,162Dec 6, 2024Updated last year
- A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).☆860Jul 10, 2024Updated last year
- [CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize …☆1,976Jan 24, 2024Updated 2 years ago
- A curated list of prompt-based paper in computer vision and vision-language learning.☆925Dec 18, 2023Updated 2 years ago
- A paper list of some recent Mamba-based CV works.☆444Nov 10, 2025Updated 3 months ago
- This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.☆601May 16, 2023Updated 2 years ago
- End-to-End Object Detection with Transformers☆15,124Mar 12, 2024Updated last year
- detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.☆2,276Sep 11, 2025Updated 5 months ago
- A collection of resources and papers on Diffusion Models☆12,273Aug 1, 2024Updated last year
- [ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions☆1,475Jun 3, 2025Updated 9 months ago
- Deformable DETR: Deformable Transformers for End-to-End Object Detection.☆3,898May 16, 2024Updated last year
- Official implementation of PVT series☆1,887Oct 27, 2022Updated 3 years ago
- A curated list of awesome self-supervised methods☆6,363Feb 24, 2026Updated last week
- Recent Advances in Vision and Language PreTrained Models (VL-PTMs)☆1,155Aug 19, 2022Updated 3 years ago
- PyTorch implementation of MAE https//arxiv.org/abs/2111.06377☆8,230Jul 23, 2024Updated last year