sail-sg/volo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sail-sg/volo)

sail-sg / volo

VOLO: Vision Outlooker for Visual Recognition

☆948

Alternatives and similar repositories for volo

Users that are interested in volo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zihangJiang / TokenLabeling
View on GitHub
Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"
☆436Sep 5, 2023Updated 2 years ago
houqb / VisionPermutator
View on GitHub
MLP-Like Vision Permutator for Visual Recognition (PyTorch)
☆192Mar 31, 2022Updated 4 years ago
yitu-opensource / T2T-ViT
View on GitHub
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
☆1,194Oct 27, 2023Updated 2 years ago
zhoudaquan / Refiner_ViT
View on GitHub
☆110Sep 15, 2021Updated 4 years ago
facebookresearch / deit
View on GitHub
Official DeiT repository
☆4,359Mar 15, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
whai362 / PVT
View on GitHub
Official implementation of PVT series
☆1,902Oct 27, 2022Updated 3 years ago
facebookresearch / xcit
View on GitHub
Official code Cross-Covariance Image Transformer (XCiT)
☆681Sep 28, 2021Updated 4 years ago
szq0214 / MEAL-V2
View on GitHub
MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks. In NeurIPS 2020 workshop.
☆701Dec 24, 2021Updated 4 years ago
JIA-Lab-research / SA-AutoAug
View on GitHub
Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)
☆198Aug 24, 2022Updated 3 years ago
ShoufaChen / CycleMLP
View on GitHub
[ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"
☆290Apr 25, 2022Updated 4 years ago
microsoft / Swin-Transformer
View on GitHub
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
☆16,010Jul 24, 2024Updated 2 years ago
lucidrains / lambda-networks
View on GitHub
Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute
☆1,528Nov 18, 2020Updated 5 years ago
PeizeSun / SparseR-CNN
View on GitHub
[CVPR2021, PAMI2023] End-to-End Object Detection with Learnable Proposal
☆1,345Apr 30, 2023Updated 3 years ago
DingXiaoH / RepVGG
View on GitHub
RepVGG: Making VGG-style ConvNets Great Again
☆3,479Feb 10, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
wofmanaf / ResT
View on GitHub
This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".
☆291Sep 28, 2022Updated 3 years ago
facebookresearch / LeViT
View on GitHub
LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference
☆624Aug 27, 2022Updated 3 years ago
sail-sg / poolformer
View on GitHub
PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
☆1,363Jun 1, 2024Updated 2 years ago
zhanghang1989 / ResNeSt
View on GitHub
ResNeSt: Split-Attention Networks
☆3,262Dec 9, 2022Updated 3 years ago
d-li14 / involution
View on GitHub
[CVPR 2021] Involution: Inverting the Inherence of Convolution for Visual Recognition, a brand new neural operator
☆1,311Jul 16, 2021Updated 5 years ago
facebookresearch / convit
View on GitHub
Code for the Convolutional Vision Transformer (ConViT)
☆474Oct 25, 2021Updated 4 years ago
xingyizhou / CenterNet2
View on GitHub
Two-stage CenterNet
☆1,222Nov 20, 2022Updated 3 years ago
facebookresearch / ConvNeXt
View on GitHub
Code release for ConvNeXt model
☆6,415Jan 8, 2023Updated 3 years ago
hustvl / YOLOS
View on GitHub
[NeurIPS 2021] You Only Look at One Sequence
☆905May 4, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
JDAI-CV / CoTNet
View on GitHub
This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
☆538Aug 8, 2021Updated 4 years ago
changlin31 / BossNAS
View on GitHub
(ICCV 2021) BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search
☆143Dec 6, 2021Updated 4 years ago
locuslab / convmixer
View on GitHub
Implementation of ConvMixer for "Patches Are All You Need? 🤷"
☆1,082Nov 11, 2022Updated 3 years ago
facebookresearch / MaskFormer
View on GitHub
Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)
☆1,463Mar 11, 2022Updated 4 years ago
mlpc-ucsd / CoaT
View on GitHub
(ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers
☆235Feb 3, 2022Updated 4 years ago
microsoft / DynamicHead
View on GitHub
☆653Nov 28, 2022Updated 3 years ago
huggingface / pytorch-image-models
View on GitHub
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…
☆37,013Jul 16, 2026Updated last week
raoyongming / DynamicViT
View on GitHub
[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
☆668Jul 11, 2023Updated 3 years ago
joe-siyuan-qiao / DetectoRS
View on GitHub
DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution
☆1,146Dec 14, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Meituan-AutoML / Twins
View on GitHub
Two simple and effective designs of vision transformer, which is on par with the Swin transformer
☆611Feb 14, 2023Updated 3 years ago
facebookresearch / dino
View on GitHub
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
☆7,609Jul 3, 2024Updated 2 years ago
PeizeSun / OneNet
View on GitHub
[ICML2021] What Makes for End-to-End Object Detection
☆641Apr 30, 2023Updated 3 years ago
microsoft / SimMIM
View on GitHub
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
☆1,047Sep 29, 2022Updated 3 years ago
DingXiaoH / RepMLP
View on GitHub
RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality (CVPR 2022)
☆307Feb 10, 2023Updated 3 years ago
openseg-group / openseg.pytorch
View on GitHub
The official Pytorch implementation of OCNet, OCRNet, and SegFix.
☆1,234Jul 25, 2024Updated 2 years ago
SwinTransformer / Transformer-SSL
View on GitHub
This is an official implementation for "Self-Supervised Learning with Swin Transformers".
☆671May 13, 2021Updated 5 years ago