sail-sg / voloView external linksLinks
VOLO: Vision Outlooker for Visual Recognition
☆949Sep 18, 2022Updated 3 years ago
Alternatives and similar repositories for volo
Users that are interested in volo are comparing it to the libraries listed below
Sorting:
- MLP-Like Vision Permutator for Visual Recognition (PyTorch)☆192Mar 31, 2022Updated 3 years ago
- Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"☆433Sep 5, 2023Updated 2 years ago
- ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet☆1,191Oct 27, 2023Updated 2 years ago
- Official code Cross-Covariance Image Transformer (XCiT)☆674Sep 28, 2021Updated 4 years ago
- Official DeiT repository☆4,323Mar 15, 2024Updated last year
- Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)☆199Aug 24, 2022Updated 3 years ago
- MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks. In NeurIPS 2020 workshop.☆701Dec 24, 2021Updated 4 years ago
- Official implementation of PVT series☆1,882Oct 27, 2022Updated 3 years ago
- ☆110Sep 15, 2021Updated 4 years ago
- [CVPR2021, PAMI2023] End-to-End Object Detection with Learnable Proposal☆1,348Apr 30, 2023Updated 2 years ago
- Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute☆1,532Nov 18, 2020Updated 5 years ago
- RepVGG: Making VGG-style ConvNets Great Again☆3,453Feb 10, 2023Updated 3 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆291Apr 25, 2022Updated 3 years ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆292Sep 28, 2022Updated 3 years ago
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆15,709Jul 24, 2024Updated last year
- ResNeSt: Split-Attention Networks☆3,265Dec 9, 2022Updated 3 years ago
- [CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator☆1,317Jul 16, 2021Updated 4 years ago
- Two-stage CenterNet☆1,222Nov 20, 2022Updated 3 years ago
- PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)☆1,365Jun 1, 2024Updated last year
- Code release for ConvNeXt model☆6,293Jan 8, 2023Updated 3 years ago
- This is an official implementation for "Contextual Transformer Networks for Visual Recognition".☆538Aug 8, 2021Updated 4 years ago
- [NeurIPS 2021] You Only Look at One Sequence☆906May 4, 2022Updated 3 years ago
- Code for the Convolutional Vision Transformer (ConViT)☆472Oct 25, 2021Updated 4 years ago
- RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality (CVPR 2022)☆307Feb 10, 2023Updated 3 years ago
- Official Pytorch implementation of ReXNet (Rank eXpansion Network) with pretrained models☆452Jan 30, 2022Updated 4 years ago
- Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)☆1,450Mar 11, 2022Updated 3 years ago
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆1,023Sep 29, 2022Updated 3 years ago
- (ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers☆236Feb 3, 2022Updated 4 years ago
- PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO☆7,443Jul 3, 2024Updated last year
- [ICML2021] What Makes for End-to-End Object Detection☆645Apr 30, 2023Updated 2 years ago
- [Preprint] ConvMLP: Hierarchical Convolutional MLPs for Vision, 2021☆167Oct 11, 2022Updated 3 years ago
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆608Feb 14, 2023Updated 3 years ago
- DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution☆1,148Dec 14, 2021Updated 4 years ago
- ☆650Nov 28, 2022Updated 3 years ago
- [NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification☆648Jul 11, 2023Updated 2 years ago
- (ICCV 2021) BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search☆142Dec 6, 2021Updated 4 years ago
- A deep learning library for video understanding research.☆3,544Jan 12, 2026Updated last month
- Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper☆779Jan 11, 2023Updated 3 years ago
- Implementation of Bottleneck Transformer in Pytorch☆677Sep 20, 2021Updated 4 years ago