lucidrains / uniformer-pytorch
Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, debuted in ICLR 2022
☆97Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for uniformer-pytorch
- Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification☆130Updated 3 years ago
- A PyTorch implementation of Mugs proposed by our paper "Mugs: A Multi-Granular Self-Supervised Learning Framework".☆83Updated 9 months ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆77Updated 2 years ago
- ☆241Updated 2 years ago
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆67Updated 2 years ago
- Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.☆150Updated 2 years ago
- ☆69Updated last year
- Video Contrastive Learning with Global Context, ICCVW 2021☆158Updated 2 years ago
- A compilation of network architectures for vision and others without usage of self-attention mechanism☆77Updated last year
- MLP-Like Vision Permutator for Visual Recognition (PyTorch)☆190Updated 2 years ago
- PyTorch Implementation of Region Similarity Representation Learning (ReSim)☆87Updated 3 years ago
- super image for action recognition☆55Updated 2 years ago
- NeurIPS 2021, Official codes for "Efficient Training of Visual Transformers with Small Datasets".☆139Updated last year
- Unofficial PyTorch implementation of TokenLearner by Google AI☆64Updated last year
- Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch☆116Updated 3 years ago
- Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [NeurIPS 2021]☆88Updated 3 years ago
- PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)☆100Updated 2 years ago
- ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining☆98Updated 2 years ago
- PyTorch code for MUST☆105Updated last year
- Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022☆145Updated last year
- Implementation of OmniNet, Omnidirectional Representations from Transformers, in Pytorch☆56Updated 3 years ago
- Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models". https://arxiv.org/abs/2201.08371.☆173Updated 2 years ago
- Official codes for ConMIM (ICLR 2023)☆57Updated last year
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆61Updated 2 years ago
- ☆112Updated 2 years ago
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 2 years ago
- ☆132Updated last year
- [ECCV 2022] Is Appearance Free Action Recognition Possible?☆58Updated 8 months ago
- ☆189Updated last year
- [CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention☆114Updated 2 years ago