The-AI-Summer/self-attention-cv

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/The-AI-Summer/self-attention-cv)

The-AI-Summer / self-attention-cv

Implementation of various self-attention mechanisms focused on computer vision. Ongoing repository.

☆1,215

Alternatives and similar repositories for self-attention-cv

Users that are interested in self-attention-cv are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lucidrains / vit-pytorch
View on GitHub
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…
☆25,428Jun 22, 2026Updated last month
lucidrains / bottleneck-transformer-pytorch
View on GitHub
Implementation of Bottleneck Transformer in Pytorch
☆678Sep 20, 2021Updated 4 years ago
04RR / SOTA-Vision
View on GitHub
Implementation of various state of the art architectures used in computer vision.
☆32Aug 29, 2021Updated 4 years ago
huggingface / pytorch-image-models
View on GitHub
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…
☆37,000Updated this week
dk-liang / Awesome-Visual-Transformer
View on GitHub
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
☆3,588Jan 7, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Beckschen / TransUNet
View on GitHub
This repository includes the official project of TransUNet, presented in our paper: TransUNet: Transformers Make Strong Encoders for Medi…
☆3,219Feb 25, 2026Updated 4 months ago
leaderj1001 / BottleneckTransformers
View on GitHub
Bottleneck Transformers for Visual Recognition
☆279Mar 14, 2021Updated 5 years ago
lucidrains / TimeSformer-pytorch
View on GitHub
Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
☆730Aug 25, 2021Updated 4 years ago
microsoft / Swin-Transformer
View on GitHub
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
☆16,004Jul 24, 2024Updated last year
jeya-maria-jose / Medical-Transformer
View on GitHub
Official Pytorch Code for "Medical Transformer: Gated Axial-Attention for Medical Image Segmentation" - MICCAI 2021
☆861Feb 23, 2023Updated 3 years ago
jacobgil / pytorch-grad-cam
View on GitHub
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, I…
☆12,922Jul 10, 2026Updated last week
xmu-xiaoma666 / External-Attention-pytorch
View on GitHub
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.…
☆12,182Mar 16, 2026Updated 4 months ago
facebookresearch / deit
View on GitHub
Official DeiT repository
☆4,359Mar 15, 2024Updated 2 years ago
lucidrains / halonet-pytorch
View on GitHub
Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones
☆199Mar 24, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
DirtyHarryLYL / Transformer-in-Vision
View on GitHub
Recent Transformer-based CV and related works.
☆1,345Aug 22, 2023Updated 2 years ago
JunMa11 / SegLossOdyssey
View on GitHub
A collection of loss functions for medical image segmentation
☆4,010Nov 1, 2023Updated 2 years ago
csrhddlam / axial-deeplab
View on GitHub
This is a PyTorch re-implementation of Axial-DeepLab (ECCV 2020 Spotlight)
☆459Jun 22, 2021Updated 5 years ago
MenghaoGuo / Awesome-Vision-Attentions
View on GitHub
Summary of related papers on visual attention. Related code will be released based on Jittor gradually.
☆2,839Oct 20, 2024Updated last year
whai362 / PVT
View on GitHub
Official implementation of PVT series
☆1,902Oct 27, 2022Updated 3 years ago
arogozhnikov / einops
View on GitHub
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
☆9,554Jul 5, 2026Updated 2 weeks ago
google-research / vision_transformer
View on GitHub
☆12,635Jul 9, 2026Updated last week
facebookresearch / ConvNeXt
View on GitHub
Code release for ConvNeXt model
☆6,414Jan 8, 2023Updated 3 years ago
KevinMusgrave / pytorch-metric-learning
View on GitHub
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
☆6,333Aug 17, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
leaderj1001 / Stand-Alone-Self-Attention
View on GitHub
Implementing Stand-Alone Self-Attention in Vision Models using Pytorch
☆457Feb 13, 2020Updated 6 years ago
jeonsworld / ViT-pytorch
View on GitHub
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
☆2,158Jun 7, 2022Updated 4 years ago
The-AI-Summer / learn-deep-learning
View on GitHub
AI Summer's complete catalog of articles
☆113Dec 30, 2021Updated 4 years ago
lukemelas / EfficientNet-PyTorch
View on GitHub
A PyTorch implementation of EfficientNet
☆8,222Apr 8, 2022Updated 4 years ago
epfml / attention-cnn
View on GitHub
Source code for "On the Relationship between Self-Attention and Convolutional Layers"
☆1,121Jan 10, 2023Updated 3 years ago
qubvel-org / segmentation_models.pytorch
View on GitHub
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
☆11,664Updated this week
lucidrains / transformer-in-transformer
View on GitHub
Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorc…
☆306Dec 27, 2021Updated 4 years ago
The-AI-Summer / deep-learning-visuals
View on GitHub
A collection of 100 Deep Learning images and visualizations
☆80Jul 13, 2021Updated 5 years ago
utkuozbulak / pytorch-cnn-visualizations
View on GitHub
Pytorch implementation of convolutional neural network visualization techniques
☆8,222Jan 1, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
facebookresearch / pytorchvideo
View on GitHub
A deep learning library for video understanding research.
☆3,565May 5, 2026Updated 2 months ago
facebookresearch / xcit
View on GitHub
Official code Cross-Covariance Image Transformer (XCiT)
☆681Sep 28, 2021Updated 4 years ago
d-li14 / involution
View on GitHub
[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator
☆1,311Jul 16, 2021Updated 5 years ago
lucidrains / lambda-networks
View on GitHub
Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute
☆1,528Nov 18, 2020Updated 5 years ago
alohays / awesome-visual-representation-learning-with-transformers
View on GitHub
Awesome Transformers (self-attention) in Computer Vision
☆271Jul 31, 2021Updated 4 years ago
kornia / kornia
View on GitHub
🐍 Geometric Computer Vision Library for Spatial AI
☆11,282Updated this week
facebookresearch / vissl
View on GitHub
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
☆3,295Mar 3, 2024Updated 2 years ago