rish-16 / tokenlearner-pytorch
Unofficial PyTorch implementation of TokenLearner by Google AI
☆64Updated last year
Related projects ⓘ
Alternatives and complementary repositories for tokenlearner-pytorch
- Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…☆97Updated 2 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆77Updated 2 years ago
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆99Updated 2 years ago
- A PyTorch implementation of Mugs proposed by our paper "Mugs: A Multi-Granular Self-Supervised Learning Framework".☆83Updated 9 months ago
- ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining☆98Updated 2 years ago
- Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".☆78Updated last year
- A compilation of network architectures for vision and others without usage of self-attention mechanism☆77Updated last year
- Official codes for ConMIM (ICLR 2023)☆57Updated last year
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆67Updated 2 years ago
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆81Updated 4 months ago
- TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"☆33Updated 2 years ago
- [ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation☆97Updated last year
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 2 years ago
- Official PyTorch implementation of A-ViT: Adaptive Tokens for Efficient Vision Transformer (CVPR 2022)☆149Updated 2 years ago
- Official repository for the General Robust Image Task (GRIT) Benchmark☆50Updated last year
- ☆24Updated 3 years ago
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆51Updated 10 months ago
- Official repository for "Intriguing Properties of Vision Transformers" (NeurIPS 2021--Spotlight)☆176Updated 2 years ago
- ☆72Updated 2 years ago
- ☆63Updated 2 years ago
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆93Updated 2 years ago
- PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411☆109Updated last year
- MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning☆129Updated last year
- Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [NeurIPS 2021]☆88Updated 3 years ago
- [CVPR'23] Hard Patches Mining for Masked Image Modeling☆88Updated 11 months ago
- ☆65Updated last year
- ☆49Updated last year
- Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.☆150Updated 2 years ago
- A simple minimal implementation of Reversible Vision Transformers☆117Updated 8 months ago
- Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022☆145Updated last year