Official implement of "CAT: Cross Attention in Vision Transformer".
☆169Jun 25, 2022Updated 3 years ago
Alternatives and similar repositories for CAT
Users that are interested in CAT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆98Apr 27, 2022Updated 3 years ago
- Vision Transformers with Hierarchical Attention☆103Sep 11, 2025Updated 6 months ago
- CPM R-CNN: Calibrating Point-guided Misalignment in Object Detection☆10Nov 2, 2020Updated 5 years ago
- ☆43Jun 15, 2021Updated 4 years ago
- Official implementation of CrossViT. https://arxiv.org/abs/2103.14899☆414Jan 12, 2022Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆292Sep 28, 2022Updated 3 years ago
- Implementation of Convolutional enhanced image Transformer☆105Mar 27, 2021Updated 5 years ago
- Official Code of ECCV 2022 paper MS-CLIP☆91Jul 27, 2022Updated 3 years ago
- ☆110Sep 15, 2021Updated 4 years ago
- Official implementation of PVT series☆1,889Oct 27, 2022Updated 3 years ago
- This is an unofficial implementation of BOAT: Bilateral Local Attention Vision Transformer☆55Mar 29, 2022Updated 4 years ago
- ☆117Jan 17, 2026Updated 2 months ago
- SoT: Delving Deeper into Classification Head for Transformer☆50Dec 24, 2021Updated 4 years ago
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆608Feb 14, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Scene Segmentation with Dual Relation-aware Attention Network (TNNLS2020)☆42Oct 21, 2020Updated 5 years ago
- Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.☆153Jan 14, 2022Updated 4 years ago
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆157Aug 18, 2021Updated 4 years ago
- [ECCV 2022] Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"☆357Dec 14, 2022Updated 3 years ago
- Official implementation of Learning Point-guided Localization for Detection in Remote Sensing Images☆28Jul 12, 2021Updated 4 years ago
- ☆172Aug 7, 2020Updated 5 years ago
- Accelerating T2t-ViT by 1.6-3.6x.☆259Nov 25, 2021Updated 4 years ago
- Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification☆208Apr 7, 2021Updated 4 years ago
- The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI☆401Jan 14, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICLR'22] This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".☆127Oct 15, 2022Updated 3 years ago
- ☆214Dec 17, 2021Updated 4 years ago
- Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"☆66Sep 6, 2022Updated 3 years ago
- Code for ViTAS_Vision Transformer Architecture Search☆51Jul 22, 2021Updated 4 years ago
- VOLO: Vision Outlooker for Visual Recognition☆950Sep 18, 2022Updated 3 years ago
- This is an official implementation for "Contextual Transformer Networks for Visual Recognition".☆540Aug 8, 2021Updated 4 years ago
- LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference☆624Aug 27, 2022Updated 3 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆292Apr 25, 2022Updated 3 years ago
- ☆19Nov 25, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆196Feb 14, 2023Updated 3 years ago
- [MM2024] LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention☆12Dec 24, 2024Updated last year
- Official implementation of the paper Visual Parser: Representing Part-whole Hierarchies with Transformers☆121Aug 12, 2021Updated 4 years ago
- [IEEE TPAMI'23] Pyramid Pooling Transformer for Scene Understanding☆221Jun 16, 2025Updated 9 months ago
- [CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention☆119Apr 19, 2022Updated 3 years ago
- ☆57Jun 13, 2022Updated 3 years ago
- Official ImageNet Model repository☆264May 5, 2023Updated 2 years ago