cmhungsteve / Awesome-Transformer-AttentionLinks

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

☆4,912

Alternatives and similar repositories for Awesome-Transformer-Attention

Users that are interested in Awesome-Transformer-Attention are comparing it to the libraries listed below

Sorting:

dk-liang / Awesome-Visual-Transformer
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
☆3,510Updated 6 months ago
Yangzhangcst / Transformer-in-Computer-Vision
A paper list of some recent Transformer-based CV works.
☆1,337Updated this week
facebookresearch / ConvNeXt
Code release for ConvNeXt model
☆6,076Updated 2 years ago
DirtyHarryLYL / Transformer-in-Vision
Recent Transformer-based CV and related works.
☆1,334Updated last year
facebookresearch / deit
Official DeiT repository
☆4,239Updated last year
jeonsworld / ViT-pytorch
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
☆2,064Updated 3 years ago
facebookresearch / dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
☆7,024Updated last year
microsoft / Cream
This is a collection of our NAS and Vision Transformer work.
☆1,785Updated last year
open-mmlab / mmselfsup
OpenMMLab Self-Supervised Learning Toolbox and Benchmark
☆3,273Updated 2 years ago
facebookresearch / moco
PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722
☆5,026Updated last month
apple / ml-cvnets
CVNets: A library for training computer vision networks
☆1,905Updated last year
hila-chefer / Transformer-Explainability
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize …
☆1,913Updated last year
google-research / vision_transformer
☆11,632Updated 4 months ago
YangLing0818 / Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
☆3,224Updated last month
google-research / scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
☆3,618Updated 3 weeks ago
IDEA-Research / awesome-detection-transformer
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)
☆1,363Updated last year
pengzhiliang / MAE-pytorch
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
☆2,661Updated 2 years ago
jason718 / awesome-self-supervised-learning
A curated list of awesome self-supervised methods
☆6,312Updated last year
jacobgil / pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, I…
☆11,972Updated 3 months ago
facebookresearch / ConvNeXt-V2
Code release for ConvNeXt V2 model
☆1,801Updated 11 months ago
facebookresearch / fvcore
Collection of common code that's shared among different research projects in FAIR computer vision team.
☆2,157Updated 2 weeks ago
microsoft / Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
☆15,057Updated last year
MenghaoGuo / Awesome-Vision-Attentions
Summary of related papers on visual attention. Related code will be released based on Jittor gradually.
☆2,820Updated 9 months ago
open-mmlab / mmpretrain
OpenMMLab Pre-training Toolbox and Benchmark
☆3,721Updated 9 months ago
jacobgil / vit-explain
Explainability for Vision Transformers
☆986Updated 3 years ago
EdisonLeeeee / Awesome-Masked-Autoencoders
A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).
☆841Updated last year
frgfm / torch-cam
Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-…
☆2,232Updated last week
sail-sg / poolformer
PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
☆1,347Updated last year
NVlabs / SegFormer
Official PyTorch implementation of SegFormer
☆3,005Updated 11 months ago
google-research / big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
☆3,044Updated 2 months ago