The official code for the paper: https://openreview.net/forum?id=_PHymLIxuI
☆401Jan 14, 2024Updated 2 years ago
Alternatives and similar repositories for CrossFormer
Users that are interested in CrossFormer are comparing it to the libraries listed below
Sorting:
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆608Feb 14, 2023Updated 3 years ago
- [NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"☆556Mar 27, 2022Updated 3 years ago
- CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022☆589Nov 1, 2023Updated 2 years ago
- ☆214Dec 17, 2021Updated 4 years ago
- Official implementation of PVT series☆1,887Oct 27, 2022Updated 3 years ago
- This is an official implementation for "Contextual Transformer Networks for Visual Recognition".☆538Aug 8, 2021Updated 4 years ago
- Official code for paper "On the Connection between Local Attention and Dynamic Depth-wise Convolution" ICLR 2022 Spotlight☆185Nov 17, 2022Updated 3 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆291Apr 25, 2022Updated 3 years ago
- ☆31Dec 20, 2022Updated 3 years ago
- A simple, fast, efficient and end-to-end 3D object detector without NMS.☆30Nov 30, 2021Updated 4 years ago
- VOLO: Vision Outlooker for Visual Recognition☆949Sep 18, 2022Updated 3 years ago
- [NeurIPS2021] Code Release of K-Net: Towards Unified Image Segmentation☆485Dec 16, 2021Updated 4 years ago
- ☆98Apr 27, 2022Updated 3 years ago
- Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.☆153Jan 14, 2022Updated 4 years ago
- ☆110Sep 15, 2021Updated 4 years ago
- Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022☆1,174May 15, 2024Updated last year
- "SOLQ: Segmenting Objects by Learning Queries", SOLQ is an end-to-end instance segmentation framework with Transformer.☆200Apr 17, 2022Updated 3 years ago
- PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)☆1,367Jun 1, 2024Updated last year
- [ECCV-20] Official PyTorch implementation of HoughNet, a voting-based object detector.☆177Oct 15, 2022Updated 3 years ago
- open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689☆54Feb 14, 2022Updated 4 years ago
- DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)☆158Aug 18, 2021Updated 4 years ago
- Official implementation of CrossViT. https://arxiv.org/abs/2103.14899☆416Jan 12, 2022Updated 4 years ago
- SMCA replication☆21Jul 24, 2021Updated 4 years ago
- [ICLR2022] official implementation of UniFormer☆896Mar 29, 2024Updated last year
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆524Mar 14, 2023Updated 2 years ago
- ☆57Jan 17, 2022Updated 4 years ago
- Official implementation of the paper Visual Parser: Representing Part-whole Hierarchies with Transformers☆121Aug 12, 2021Updated 4 years ago
- Official code Cross-Covariance Image Transformer (XCiT)☆674Sep 28, 2021Updated 4 years ago
- Open-source code for Generic Grouping Network (GGN, CVPR 2022)☆114Jan 28, 2026Updated last month
- Accelerating T2t-ViT by 1.6-3.6x.☆258Nov 25, 2021Updated 4 years ago
- This is a collection of our NAS and Vision Transformer work.☆1,823Jul 25, 2024Updated last year
- ☆650Nov 28, 2022Updated 3 years ago
- PyTorch Implementation of Region Similarity Representation Learning (ReSim)☆89Jul 27, 2021Updated 4 years ago
- Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"☆66Sep 6, 2022Updated 3 years ago
- ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet☆1,192Oct 27, 2023Updated 2 years ago
- ICCV2021 (Oral) - Exploring Cross-Image Pixel Contrast for Semantic Segmentation☆689Oct 13, 2022Updated 3 years ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆292Sep 28, 2022Updated 3 years ago
- [Preprint] ConvMLP: Hierarchical Convolutional MLPs for Vision, 2021☆167Oct 11, 2022Updated 3 years ago
- Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)☆1,450Mar 11, 2022Updated 3 years ago