tahmid0007/VisualTransformers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tahmid0007/VisualTransformers)

tahmid0007 / VisualTransformers

A Pytorch Implementation of the following paper "Visual Transformers: Token-based Image Representation and Processing for Computer Vision"

☆182

Alternatives and similar repositories for VisualTransformers

Users that are interested in VisualTransformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tahmid0007 / VisionTransformer
View on GitHub
A complete easy to follow implementation of Google's Vision Transformer proposed in "AN IMAGE IS WORTH 16X16 WORDS". This pytorch impleme…
☆101Dec 1, 2020Updated 5 years ago
NazirNayal8 / visual-transformer
View on GitHub
An implementation of the Visual Transformer Architecture introduced in the paper "Visual Transformers: Token-based Image Representation a…
☆17May 27, 2021Updated 5 years ago
aws-samples / amazon-sagemaker-visual-transformer
View on GitHub
Implementation of Image Classification using Visual Transformers in Amazon SageMaker based on the ideas from research paper - Visual Tran…
☆18Dec 28, 2020Updated 5 years ago
naver-ai / pit
View on GitHub
☆245Jul 23, 2021Updated 5 years ago
blanclist / ICNet
View on GitHub
ICNet: Intra-saliency Correlation Network for Co-Saliency Detection, NeurIPS(2020)
☆30Apr 18, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
dk-liang / Awesome-Visual-Transformer
View on GitHub
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
☆3,589Jan 7, 2025Updated last year
yitu-opensource / T2T-ViT
View on GitHub
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
☆1,194Oct 27, 2023Updated 2 years ago
gbup-group / EAN-efficient-attention-network
View on GitHub
The implementation of paper ''Efficient Attention Network: Accelerate Attention by Searching Where to Plug''.
☆20Jun 16, 2023Updated 3 years ago
CASIA-LMC-Lab / DPT
View on GitHub
DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)
☆158Aug 18, 2021Updated 4 years ago
DTennant / CL-Visualizing-Feature-Transformation
View on GitHub
Improving Contrastive Learning by Visualizing Feature Transformation, ICCV 2021 Oral
☆90Oct 11, 2021Updated 4 years ago
whai362 / PVT
View on GitHub
Official implementation of PVT series
☆1,902Oct 27, 2022Updated 3 years ago
alohays / awesome-visual-representation-learning-with-transformers
View on GitHub
Awesome Transformers (self-attention) in Computer Vision
☆271Jul 31, 2021Updated 4 years ago
lucidrains / transformer-in-transformer
View on GitHub
Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorc…
☆306Dec 27, 2021Updated 4 years ago
facebookresearch / deit
View on GitHub
Official DeiT repository
☆4,357Mar 15, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
fudan-zvg / SETR
View on GitHub
[CVPR 2021 & IJCV 2024] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
☆1,108Sep 2, 2024Updated last year
qdu1995 / DQSD
View on GitHub
☆11Jun 27, 2021Updated 5 years ago
lucidrains / vit-pytorch
View on GitHub
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…
☆25,447Jun 22, 2026Updated last month
csrhddlam / axial-deeplab
View on GitHub
This is a PyTorch re-implementation of Axial-DeepLab (ECCV 2020 Spotlight)
☆459Jun 22, 2021Updated 5 years ago
guanfuchen / cvpr_review
View on GitHub
整理cvpr论文，包括摘要，动机，架构，结果，总结
☆27Dec 15, 2018Updated 7 years ago
VITA-Group / TransGAN
View on GitHub
[NeurIPS‘2021] "TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up", Yifan Jiang, Shiyu Chang, Zhangyang Wang
☆1,693Nov 3, 2022Updated 3 years ago
kenomo / industrial-clip
View on GitHub
Code for training and evaluation on the "Industrial Language-Image Dataset (ILID)".
☆10Jun 4, 2025Updated last year
jiangtaoxie / SoT
View on GitHub
SoT: Delving Deeper into Classification Head for Transformer
☆50Dec 24, 2021Updated 4 years ago
zipengxuc / PPE
View on GitHub
Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…
☆37Apr 13, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
delair-ai / DISIR
View on GitHub
Deep Image Segmentation with Interactive Refinement
☆35Feb 8, 2023Updated 3 years ago
yuexy / PS-ViT
View on GitHub
Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.
☆153Jan 14, 2022Updated 4 years ago
roiponytch / Flickering_Adversarial_Video
View on GitHub
Code and videos accompanying the paper "Flickering Adversarial Attacks against Video Recognition Networks"
☆16Dec 8, 2022Updated 3 years ago
Muzammal-Naseer / IPViT
View on GitHub
Official repository for "Intriguing Properties of Vision Transformers" (NeurIPS 2021--Spotlight)
☆183Aug 9, 2022Updated 3 years ago
Meituan-AutoML / CPVT
View on GitHub
☆196Feb 14, 2023Updated 3 years ago
mrochan / adaptive-highlight
View on GitHub
☆21Nov 29, 2020Updated 5 years ago
kevin-ssy / ViP
View on GitHub
Official implementation of the paper Visual Parser: Representing Part-whole Hierarchies with Transformers
☆120Aug 12, 2021Updated 4 years ago
mengqiDyangge / HierKD
View on GitHub
☆39Aug 25, 2022Updated 3 years ago
jeonsworld / ViT-pytorch
View on GitHub
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
☆2,159Jun 7, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
rishikksh20 / CrossViT-pytorch
View on GitHub
Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
☆208Apr 7, 2021Updated 5 years ago
bermanmaxim / superpixPool
View on GitHub
Superpixel Pooling implemented in PyTorch and Chainer
☆136Aug 8, 2019Updated 6 years ago
prismformore / expAT
View on GitHub
TIP: Bi-directional Exponential Angular Triplet Loss for RGB-Infrared Person Re-Identification
☆21Mar 29, 2021Updated 5 years ago
sigopt / stanford-car-classification
View on GitHub
Classifying the Stanford Car dataset using ResNet 50
☆25Aug 17, 2023Updated 2 years ago
The-AI-Summer / self-attention-cv
View on GitHub
Implementation of various self-attention mechanisms focused on computer vision. Ongoing repository.
☆1,215Sep 14, 2021Updated 4 years ago
friendshipkim / neuron-merging
View on GitHub
Neuron Merging: Compensating for Pruned Neurons (NeurIPS 2020)
☆43Feb 4, 2021Updated 5 years ago
srebuffi / revisiting_saliency
View on GitHub
There and Back Again: Revisiting Backpropagation Saliency Methods (CVPR 2020)
☆53Apr 7, 2020Updated 6 years ago