Official implement of "CAT: Cross Attention in Vision Transformer".
☆169Jun 25, 2022Updated 3 years ago
Alternatives and similar repositories for CAT
Users that are interested in CAT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆98Apr 27, 2022Updated 4 years ago
- Vision Transformers with Hierarchical Attention☆103Sep 11, 2025Updated 9 months ago
- CPM R-CNN: Calibrating Point-guided Misalignment in Object Detection☆10Nov 2, 2020Updated 5 years ago
- ☆43Jun 15, 2021Updated 5 years ago
- Official implementation of CrossViT. https://arxiv.org/abs/2103.14899☆419Jan 12, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆292Sep 28, 2022Updated 3 years ago
- Official Code of ECCV 2022 paper MS-CLIP☆91Jul 27, 2022Updated 3 years ago
- ☆110Sep 15, 2021Updated 4 years ago
- Official implementation of PVT series☆1,898Oct 27, 2022Updated 3 years ago
- This is an unofficial implementation of BOAT: Bilateral Local Attention Vision Transformer☆55Mar 29, 2022Updated 4 years ago
- ☆117Jan 17, 2026Updated 5 months ago
- SoT: Delving Deeper into Classification Head for Transformer☆50Dec 24, 2021Updated 4 years ago
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆610Feb 14, 2023Updated 3 years ago
- Scene Segmentation with Dual Relation-aware Attention Network (TNNLS2020)☆42Oct 21, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆158Aug 18, 2021Updated 4 years ago
- Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.☆153Jan 14, 2022Updated 4 years ago
- [ECCV 2022] Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"☆357Dec 14, 2022Updated 3 years ago
- Official implementation of Learning Point-guided Localization for Detection in Remote Sensing Images☆28Jul 12, 2021Updated 4 years ago
- ☆172Aug 7, 2020Updated 5 years ago
- Accelerating T2t-ViT by 1.6-3.6x.☆260Nov 25, 2021Updated 4 years ago
- Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification☆208Apr 7, 2021Updated 5 years ago
- The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI☆403Jan 14, 2024Updated 2 years ago
- ☆215Dec 17, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"☆66Sep 6, 2022Updated 3 years ago
- Code for ViTAS_Vision Transformer Architecture Search☆50Jul 22, 2021Updated 4 years ago
- VOLO: Vision Outlooker for Visual Recognition☆948Sep 18, 2022Updated 3 years ago
- This is an official implementation for "Contextual Transformer Networks for Visual Recognition".☆538Aug 8, 2021Updated 4 years ago
- LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference☆623Aug 27, 2022Updated 3 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆290Apr 25, 2022Updated 4 years ago
- ☆19Nov 25, 2022Updated 3 years ago
- ☆196Feb 14, 2023Updated 3 years ago
- [MM2024] LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention☆13Dec 24, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official implementation of the paper Visual Parser: Representing Part-whole Hierarchies with Transformers☆120Aug 12, 2021Updated 4 years ago
- [IEEE TPAMI'23] Pyramid Pooling Transformer for Scene Understanding☆222Jun 16, 2025Updated last year
- [CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention☆119Apr 19, 2022Updated 4 years ago
- Official ImageNet Model repository☆274May 5, 2023Updated 3 years ago
- CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022☆585Nov 1, 2023Updated 2 years ago
- "Exploring Vision Transformers for Fine-grained Classification" at CVPRW FGVC8☆30Jun 25, 2021Updated 4 years ago
- Recent Transformer-based CV and related works.☆1,343Aug 22, 2023Updated 2 years ago