linhezheng19/CAT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/linhezheng19/CAT)

linhezheng19 / CAT

Official implement of "CAT: Cross Attention in Vision Transformer".

☆169

Alternatives and similar repositories for CAT

Users that are interested in CAT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mulinmeng / Shuffle-Transformer
View on GitHub
☆98Apr 27, 2022Updated 4 years ago
yun-liu / HAT-Net
View on GitHub
Vision Transformers with Hierarchical Attention
☆103Sep 11, 2025Updated 10 months ago
zhubinQAQ / CPM-R-CNN
View on GitHub
CPM R-CNN: Calibrating Point-guided Misalignment in Object Detection
☆10Nov 2, 2020Updated 5 years ago
starmemda / MlTr
View on GitHub
☆43Jun 15, 2021Updated 5 years ago
IBM / CrossViT
View on GitHub
Official implementation of CrossViT. https://arxiv.org/abs/2103.14899
☆417Jan 12, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
wofmanaf / ResT
View on GitHub
This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".
☆291Sep 28, 2022Updated 3 years ago
Hxyou / MSCLIP
View on GitHub
Official Code of ECCV 2022 paper MS-CLIP
☆91Jul 27, 2022Updated 4 years ago
zhoudaquan / Refiner_ViT
View on GitHub
☆110Sep 15, 2021Updated 4 years ago
whai362 / PVT
View on GitHub
Official implementation of PVT series
☆1,902Oct 27, 2022Updated 3 years ago
mahaoyuHKU / pytorch-boat
View on GitHub
This is an unofficial implementation of BOAT: Bilateral Local Attention Vision Transformer
☆55Mar 29, 2022Updated 4 years ago
jiangtaoxie / SoT
View on GitHub
SoT: Delving Deeper into Classification Head for Transformer
☆50Dec 24, 2021Updated 4 years ago
ofsoundof / LocalViT
View on GitHub
☆118Jan 17, 2026Updated 6 months ago
Meituan-AutoML / Twins
View on GitHub
Two simple and effective designs of vision transformer, which is on par with the Swin transformer
☆611Feb 14, 2023Updated 3 years ago
CASIA-LMC-Lab / DPT
View on GitHub
DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)
☆158Aug 18, 2021Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
junfu1115 / DRAN
View on GitHub
Scene Segmentation with Dual Relation-aware Attention Network (TNNLS2020)
☆42Oct 21, 2020Updated 5 years ago
yuexy / PS-ViT
View on GitHub
Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.
☆153Jan 14, 2022Updated 4 years ago
hkzhang-git / ParC-Net
View on GitHub
[ECCV 2022] Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"
☆359Dec 14, 2022Updated 3 years ago
yf19970118 / OPLD-Pytorch
View on GitHub
Official implementation of Learning Point-guided Localization for Detection in Remote Sensing Images
☆28Jul 12, 2021Updated 5 years ago
megvii-model / WeightNet
View on GitHub
☆172Aug 7, 2020Updated 5 years ago
blackfeather-wang / Dynamic-Vision-Transformer
View on GitHub
Accelerating T2t-ViT by 1.6-3.6x.
☆260Nov 25, 2021Updated 4 years ago
cheerss / CrossFormer
View on GitHub
The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI
☆403Jan 14, 2024Updated 2 years ago
rishikksh20 / CrossViT-pytorch
View on GitHub
Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
☆208Apr 7, 2021Updated 5 years ago
svip-lab / AS-MLP
View on GitHub
[ICLR'22] This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".
☆127Oct 15, 2022Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
facebookresearch / LeViT
View on GitHub
LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference
☆624Aug 27, 2022Updated 3 years ago
OliverRensu / Shunted-Transformer
View on GitHub
☆216Dec 17, 2021Updated 4 years ago
xiusu / ViTAS
View on GitHub
Code for ViTAS_Vision Transformer Architecture Search
☆50Jul 22, 2021Updated 5 years ago
sail-sg / volo
View on GitHub
VOLO: Vision Outlooker for Visual Recognition
☆948Sep 18, 2022Updated 3 years ago
szq0214 / SReT
View on GitHub
Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"
☆66Sep 6, 2022Updated 3 years ago
JDAI-CV / CoTNet
View on GitHub
This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
☆538Aug 8, 2021Updated 4 years ago
ShoufaChen / CycleMLP
View on GitHub
[ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"
☆290Apr 25, 2022Updated 4 years ago
Meituan-AutoML / CPVT
View on GitHub
☆196Feb 14, 2023Updated 3 years ago
kevin-ssy / ViP
View on GitHub
Official implementation of the paper Visual Parser: Representing Part-whole Hierarchies with Transformers
☆120Aug 12, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yuhuan-wu / P2T
View on GitHub
[IEEE TPAMI'23] Pyramid Pooling Transformer for Scene Understanding
☆221Jun 16, 2025Updated last year
moabarar / qna
View on GitHub
[CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention
☆119Apr 19, 2022Updated 4 years ago
duzw9311 / LDA-AQU
View on GitHub
[MM2024] LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention
☆13Dec 24, 2024Updated last year
YehLi / ImageNetModel
View on GitHub
Official ImageNet Model repository
☆274May 5, 2023Updated 3 years ago
microsoft / CSWin-Transformer
View on GitHub
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022
☆586Nov 1, 2023Updated 2 years ago
kkkls / EVSSM
View on GitHub
[CVPR 2025] Efficient Visual State Space Model for Image Deblurring; 1st place on AIM 2025 challenge on High FPS Motion Deblurring:
☆155Jul 3, 2025Updated last year
mv-lab / ViT-FGVC8
View on GitHub
"Exploring Vision Transformers for Fine-grained Classification" at CVPRW FGVC8
☆30Jun 25, 2021Updated 5 years ago