Official implement of "CAT: Cross Attention in Vision Transformer".
☆169Jun 25, 2022Updated 3 years ago
Alternatives and similar repositories for CAT
Users that are interested in CAT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆98Apr 27, 2022Updated 3 years ago
- Vision Transformers with Hierarchical Attention☆103Sep 11, 2025Updated 7 months ago
- CPM R-CNN: Calibrating Point-guided Misalignment in Object Detection☆10Nov 2, 2020Updated 5 years ago
- ☆43Jun 15, 2021Updated 4 years ago
- Official implementation of CrossViT. https://arxiv.org/abs/2103.14899☆416Jan 12, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆292Sep 28, 2022Updated 3 years ago
- Implementation of Convolutional enhanced image Transformer☆105Mar 27, 2021Updated 5 years ago
- Official Code of ECCV 2022 paper MS-CLIP☆91Jul 27, 2022Updated 3 years ago
- ☆110Sep 15, 2021Updated 4 years ago
- Official implementation of PVT series☆1,890Oct 27, 2022Updated 3 years ago
- This is an unofficial implementation of BOAT: Bilateral Local Attention Vision Transformer☆55Mar 29, 2022Updated 4 years ago
- ☆117Jan 17, 2026Updated 3 months ago
- SoT: Delving Deeper into Classification Head for Transformer☆50Dec 24, 2021Updated 4 years ago
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆609Feb 14, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Scene Segmentation with Dual Relation-aware Attention Network (TNNLS2020)☆42Oct 21, 2020Updated 5 years ago
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆157Aug 18, 2021Updated 4 years ago
- Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.☆153Jan 14, 2022Updated 4 years ago
- [ECCV 2022] Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"☆357Dec 14, 2022Updated 3 years ago
- Official implementation of Learning Point-guided Localization for Detection in Remote Sensing Images☆28Jul 12, 2021Updated 4 years ago
- ☆172Aug 7, 2020Updated 5 years ago
- Accelerating T2t-ViT by 1.6-3.6x.☆259Nov 25, 2021Updated 4 years ago
- Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification☆208Apr 7, 2021Updated 5 years ago
- The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI☆402Jan 14, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR'22] This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".☆127Oct 15, 2022Updated 3 years ago
- ☆214Dec 17, 2021Updated 4 years ago
- Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"☆66Sep 6, 2022Updated 3 years ago
- Code for ViTAS_Vision Transformer Architecture Search☆51Jul 22, 2021Updated 4 years ago
- VOLO: Vision Outlooker for Visual Recognition☆950Sep 18, 2022Updated 3 years ago
- This is an official implementation for "Contextual Transformer Networks for Visual Recognition".☆539Aug 8, 2021Updated 4 years ago
- LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference☆623Aug 27, 2022Updated 3 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆290Apr 25, 2022Updated 3 years ago
- ☆19Nov 25, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆196Feb 14, 2023Updated 3 years ago
- [MM2024] LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention☆13Dec 24, 2024Updated last year
- Official implementation of the paper Visual Parser: Representing Part-whole Hierarchies with Transformers☆121Aug 12, 2021Updated 4 years ago
- [IEEE TPAMI'23] Pyramid Pooling Transformer for Scene Understanding☆221Jun 16, 2025Updated 10 months ago
- [CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention☆119Apr 19, 2022Updated 4 years ago
- ☆56Jun 13, 2022Updated 3 years ago
- Official ImageNet Model repository☆268May 5, 2023Updated 2 years ago