[CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention
☆118Apr 19, 2022Updated 3 years ago
Alternatives and similar repositories for qna
Users that are interested in qna are comparing it to the libraries listed below
Sorting:
- [ACCV2022 (Oral)] Efficient Hardware-aware Neural Architecture Search for Image Super-resolution on Mobile Devices☆18Oct 5, 2022Updated 3 years ago
- [ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmen…☆488Jun 2, 2023Updated 2 years ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Feb 21, 2022Updated 4 years ago
- Pytorch implementation of Mix-Shifting-MLP (MS-MLP)☆16Feb 16, 2022Updated 4 years ago
- ☆214Dec 17, 2021Updated 4 years ago
- Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022☆1,175May 15, 2024Updated last year
- Official implementation of "Clustering as Attention: Unified Image Segmentation with Hierarchical Clustering"☆32Jun 16, 2022Updated 3 years ago
- open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689☆55Feb 14, 2022Updated 4 years ago
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆97Jun 19, 2022Updated 3 years ago
- Lite Vision Transformer (CVPR 2022)☆144Oct 21, 2022Updated 3 years ago
- PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)☆1,367Jun 1, 2024Updated last year
- Featurized Query R-CNN☆45Jun 17, 2022Updated 3 years ago
- Official implementation for "SimA: Simple Softmax-free Attention for Vision Transformers"☆46Apr 18, 2024Updated last year
- Implementation of Hire-MLP: Vision MLP via Hierarchical Rearrangement and An Image Patch is a Wave: Phase-Aware Vision MLP.☆37Oct 15, 2022Updated 3 years ago
- MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens (CVPR 2022)☆80Oct 20, 2022Updated 3 years ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆292Sep 28, 2022Updated 3 years ago
- Lightweight Transformer for Multi-modal Tasks☆16Dec 9, 2022Updated 3 years ago
- ☆16Jul 7, 2023Updated 2 years ago
- [NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"☆555Mar 27, 2022Updated 3 years ago
- SOIT: Segmenting Objects with Instance-Aware Transformers☆14Jun 6, 2022Updated 3 years ago
- CMT Pytorch implementation of our CVPR 2022 paper CMT: Convolutional Neural Networks Meet Vision Transformers (https://arxiv.org/pdf/2107…☆103Jul 1, 2022Updated 3 years ago
- ☆50Jan 23, 2022Updated 4 years ago
- Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning☆20Feb 4, 2022Updated 4 years ago
- ☆31Mar 14, 2022Updated 3 years ago
- TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022☆403Oct 27, 2022Updated 3 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆291Apr 25, 2022Updated 3 years ago
- Implementation of the paper ''Implicit Feature Refinement for Instance Segmentation''.☆20Oct 27, 2021Updated 4 years ago
- Official Jax Implementation of MaskGIT☆554Nov 18, 2022Updated 3 years ago
- Official code Cross-Covariance Image Transformer (XCiT)☆674Sep 28, 2021Updated 4 years ago
- The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI☆401Jan 14, 2024Updated 2 years ago
- Official PyTorch implementation of the paper: "Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results" (2022)☆193Jan 11, 2023Updated 3 years ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆29Jan 23, 2024Updated 2 years ago
- ☆249Mar 16, 2022Updated 3 years ago
- [CVPR 2022] Unofficial repository for "MAXIM: Multi-Axis MLP for Image Processing". Official repo: https://github.com/google-research/max…☆20Mar 31, 2022Updated 3 years ago
- QuadTree Attention for Vision Transformers (ICLR2022)☆364Apr 23, 2024Updated last year
- Official implementation of the paper Visual Parser: Representing Part-whole Hierarchies with Transformers☆121Aug 12, 2021Updated 4 years ago
- Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Atte…☆926Apr 17, 2024Updated last year
- Official repository of ACmix (CVPR2022)☆410Apr 25, 2022Updated 3 years ago
- official code for dynamic convolution decomposition☆133Nov 22, 2021Updated 4 years ago