[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers
☆446Dec 22, 2023Updated 2 years ago
Alternatives and similar repositories for GCVit
Users that are interested in GCVit are comparing it to the libraries listed below
Sorting:
- Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022☆1,175May 15, 2024Updated last year
- [ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention☆906Jul 22, 2025Updated 7 months ago
- A method to increase the speed and lower the memory footprint of existing vision transformers.☆1,171Jun 17, 2024Updated last year
- EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]☆1,109Aug 13, 2023Updated 2 years ago
- [NeurIPS 2022] Official code for "Focal Modulation Networks"☆751Nov 7, 2023Updated 2 years ago
- Official PyTorch implementation of the paper: "Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results" (2022)☆193Jan 11, 2023Updated 3 years ago
- Official PyTorch implementation of Fully Attentional Networks☆482Mar 31, 2023Updated 2 years ago
- [ICLR 2023] "More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity"; [ICML 2023] "Are Large Kernels Better Teachers…☆284Jul 5, 2023Updated 2 years ago
- PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)☆1,367Jun 1, 2024Updated last year
- Official DeiT repository☆4,325Mar 15, 2024Updated last year
- This is a offical PyTorch/GPU implementation of SupMAE.☆80Aug 30, 2022Updated 3 years ago
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 2 years ago
- Code release for ConvNeXt V2 model☆1,975Aug 14, 2024Updated last year
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆100Oct 14, 2022Updated 3 years ago
- [NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"☆556Mar 27, 2022Updated 3 years ago
- Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.☆783May 10, 2022Updated 3 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "Fast Vision Transformers with HiLo Attention"☆297Nov 17, 2023Updated 2 years ago
- Code release for ConvNeXt model☆6,300Jan 8, 2023Updated 3 years ago
- [ICLR2022] official implementation of UniFormer☆896Mar 29, 2024Updated last year
- MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022☆615Dec 13, 2022Updated 3 years ago
- Official implementation of PVT series☆1,887Oct 27, 2022Updated 3 years ago
- [ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmen…☆488Jun 2, 2023Updated 2 years ago
- CVNets: A library for training computer vision networks☆1,967Oct 30, 2023Updated 2 years ago
- (ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"☆823Jul 14, 2022Updated 3 years ago
- [CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Ap…☆411Jul 25, 2023Updated 2 years ago
- ☆817Jul 30, 2022Updated 3 years ago
- [ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)☆2,243Dec 22, 2022Updated 3 years ago
- [CVPR 2022] MPViT:Multi-Path Vision Transformer for Dense Prediction☆389Mar 2, 2022Updated 4 years ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆524Mar 14, 2023Updated 2 years ago
- An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites☆5,016Jul 30, 2024Updated last year
- [ICCV - 2023] Official repository of paper SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applic…☆311Jul 18, 2025Updated 7 months ago
- [CVPR 2022] StyleSwin: Transformer-based GAN for High-resolution Image Generation☆538Jul 30, 2024Updated last year
- [CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation☆1,703Oct 3, 2024Updated last year
- Official Pytorch implementations for "SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation" (NeurIPS 2022)☆869Nov 22, 2022Updated 3 years ago
- Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.☆4,385Mar 15, 2025Updated 11 months ago
- CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022☆589Nov 1, 2023Updated 2 years ago
- Learning Features with Parameter-Free Layers, ICLR 2022☆84May 3, 2023Updated 2 years ago
- ASSET: Autoregressive Semantic Scene Editing with Transformers at High Resolutions (SIGGRAPH 2022 - Journal Track)☆112May 25, 2022Updated 3 years ago
- FFCV: Fast Forward Computer Vision (and other ML workloads!)☆2,985Jun 16, 2024Updated last year