Sense-X/UniFormer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Sense-X/UniFormer)

Sense-X / UniFormer

[ICLR2022] official implementation of UniFormer

☆906

Alternatives and similar repositories for UniFormer

Users that are interested in UniFormer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenGVLab / UniFormerV2
View on GitHub
[ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer
☆350Apr 2, 2024Updated 2 years ago
sail-sg / poolformer
View on GitHub
PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
☆1,363Jun 1, 2024Updated 2 years ago
MCG-NJU / VideoMAE
View on GitHub
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆1,775Dec 8, 2023Updated 2 years ago
SwinTransformer / Video-Swin-Transformer
View on GitHub
This is an official implementation for "Video Swin Transformers".
☆1,667Mar 8, 2023Updated 3 years ago
facebookresearch / ConvNeXt
View on GitHub
Code release for ConvNeXt model
☆6,414Jan 8, 2023Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
facebookresearch / omnivore
View on GitHub
Omnivore: A Single Model for Many Visual Modalities
☆573Nov 12, 2022Updated 3 years ago
ShoufaChen / CycleMLP
View on GitHub
[ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"
☆290Apr 25, 2022Updated 4 years ago
facebookresearch / TimeSformer
View on GitHub
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
☆1,863Apr 9, 2024Updated 2 years ago
facebookresearch / SlowFast
View on GitHub
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆7,392Mar 16, 2026Updated 4 months ago
hustvl / MIMDet
View on GitHub
[ICCV 2023] You Only Look at One Partial Sequence
☆343Oct 21, 2023Updated 2 years ago
fudan-zvg / SOFT
View on GitHub
[NeurIPS 2021 Spotlight] & [IJCV 2024] SOFT: Softmax-free Transformer with Linear Complexity
☆310Mar 16, 2024Updated 2 years ago
Meituan-AutoML / Twins
View on GitHub
Two simple and effective designs of vision transformer, which is on par with the Swin transformer
☆611Feb 14, 2023Updated 3 years ago
whai362 / PVT
View on GitHub
Official implementation of PVT series
☆1,902Oct 27, 2022Updated 3 years ago
SHI-Labs / Neighborhood-Attention-Transformer
View on GitHub
Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022
☆1,182May 15, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
facebookresearch / mae
View on GitHub
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
☆8,366Jul 23, 2024Updated last year
xxxnell / how-do-vits-work
View on GitHub
(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"
☆822Jul 14, 2022Updated 4 years ago
KimManjin / RSA
View on GitHub
Official Pytorch Implementation of Relational Self-Attention, NeurIPS 2021
☆49Dec 7, 2021Updated 4 years ago
OpenGVLab / efficient-video-recognition
View on GitHub
☆184Aug 20, 2022Updated 3 years ago
microsoft / Swin-Transformer
View on GitHub
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
☆16,004Jul 24, 2024Updated last year
lucidrains / uniformer-pytorch
View on GitHub
Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…
☆102Apr 22, 2022Updated 4 years ago
czczup / ViT-Adapter
View on GitHub
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
☆1,503Jun 3, 2025Updated last year
facebookresearch / Motionformer
View on GitHub
Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers
☆234Jun 13, 2022Updated 4 years ago
MCG-NJU / TDN
View on GitHub
[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition
☆384Sep 17, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Alpha-VL / ConvMAE
View on GitHub
ConvMAE: Masked Convolution Meets Masked Autoencoders
☆530Mar 14, 2023Updated 3 years ago
locuslab / convmixer
View on GitHub
Implementation of ConvMixer for "Patches Are All You Need? 🤷"
☆1,081Nov 11, 2022Updated 3 years ago
microsoft / SimMIM
View on GitHub
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
☆1,047Sep 29, 2022Updated 3 years ago
facebookresearch / deit
View on GitHub
Official DeiT repository
☆4,359Mar 15, 2024Updated 2 years ago
dingmyu / davit
View on GitHub
[ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"
☆378Feb 13, 2024Updated 2 years ago
sail-sg / volo
View on GitHub
VOLO: Vision Outlooker for Visual Recognition
☆948Sep 18, 2022Updated 3 years ago
VITA-Group / AsViT
View on GitHub
[ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…
☆76Feb 21, 2022Updated 4 years ago
facebookresearch / MaskFormer
View on GitHub
Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)
☆1,462Mar 11, 2022Updated 4 years ago
wofmanaf / ResT
View on GitHub
This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".
☆291Sep 28, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
facebookresearch / xcit
View on GitHub
Official code Cross-Covariance Image Transformer (XCiT)
☆681Sep 28, 2021Updated 4 years ago
ZwwWayne / K-Net
View on GitHub
[NeurIPS2021] Code Release of K-Net: Towards Unified Image Segmentation
☆484Dec 16, 2021Updated 4 years ago
snap-research / EfficientFormer
View on GitHub
EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]
☆1,115Aug 13, 2023Updated 2 years ago
mit-han-lab / temporal-shift-module
View on GitHub
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
☆2,215Jul 11, 2024Updated 2 years ago
facebookresearch / SLIP
View on GitHub
Code release for SLIP Self-supervision meets Language-Image Pre-training
☆792Feb 9, 2023Updated 3 years ago
fundamentalvision / Deformable-DETR
View on GitHub
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
☆4,001May 16, 2024Updated 2 years ago
facebookresearch / msn
View on GitHub
Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)
☆463May 9, 2022Updated 4 years ago