Official implement of "CAT: Cross Attention in Vision Transformer".
☆169Jun 25, 2022Updated 3 years ago
Alternatives and similar repositories for CAT
Users that are interested in CAT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆98Apr 27, 2022Updated 4 years ago
- Vision Transformers with Hierarchical Attention☆103Sep 11, 2025Updated 8 months ago
- CPM R-CNN: Calibrating Point-guided Misalignment in Object Detection☆10Nov 2, 2020Updated 5 years ago
- ☆43Jun 15, 2021Updated 4 years ago
- Official implementation of CrossViT. https://arxiv.org/abs/2103.14899☆418Jan 12, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆292Sep 28, 2022Updated 3 years ago
- Official Code of ECCV 2022 paper MS-CLIP☆91Jul 27, 2022Updated 3 years ago
- ☆110Sep 15, 2021Updated 4 years ago
- Official implementation of PVT series☆1,894Oct 27, 2022Updated 3 years ago
- This is an unofficial implementation of BOAT: Bilateral Local Attention Vision Transformer☆55Mar 29, 2022Updated 4 years ago
- ☆117Jan 17, 2026Updated 4 months ago
- SoT: Delving Deeper into Classification Head for Transformer☆50Dec 24, 2021Updated 4 years ago
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆609Feb 14, 2023Updated 3 years ago
- Scene Segmentation with Dual Relation-aware Attention Network (TNNLS2020)☆42Oct 21, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆157Aug 18, 2021Updated 4 years ago
- Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.☆153Jan 14, 2022Updated 4 years ago
- [ECCV 2022] Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"☆357Dec 14, 2022Updated 3 years ago
- Official implementation of Learning Point-guided Localization for Detection in Remote Sensing Images☆28Jul 12, 2021Updated 4 years ago
- ☆172Aug 7, 2020Updated 5 years ago
- Accelerating T2t-ViT by 1.6-3.6x.☆260Nov 25, 2021Updated 4 years ago
- Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification☆208Apr 7, 2021Updated 5 years ago
- The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI☆402Jan 14, 2024Updated 2 years ago
- [ICLR'22] This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".☆127Oct 15, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆215Dec 17, 2021Updated 4 years ago
- Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"☆66Sep 6, 2022Updated 3 years ago
- Code for ViTAS_Vision Transformer Architecture Search☆51Jul 22, 2021Updated 4 years ago
- VOLO: Vision Outlooker for Visual Recognition☆946Sep 18, 2022Updated 3 years ago
- This is an official implementation for "Contextual Transformer Networks for Visual Recognition".☆538Aug 8, 2021Updated 4 years ago
- LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference☆624Aug 27, 2022Updated 3 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆290Apr 25, 2022Updated 4 years ago
- ☆19Nov 25, 2022Updated 3 years ago
- ☆196Feb 14, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [MM2024] LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention☆13Dec 24, 2024Updated last year
- Official implementation of the paper Visual Parser: Representing Part-whole Hierarchies with Transformers☆120Aug 12, 2021Updated 4 years ago
- [IEEE TPAMI'23] Pyramid Pooling Transformer for Scene Understanding☆222Jun 16, 2025Updated 11 months ago
- [CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention☆119Apr 19, 2022Updated 4 years ago
- ☆56Jun 13, 2022Updated 3 years ago
- Official ImageNet Model repository☆272May 5, 2023Updated 3 years ago
- CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022☆585Nov 1, 2023Updated 2 years ago