SforAiDl / vformer
A modular PyTorch library for vision transformer models
☆162Updated last year
Alternatives and similar repositories for vformer:
Users that are interested in vformer are comparing it to the libraries listed below
- Pytorch implementation of LOST unsupervised object discovery method☆244Updated last year
- Probing the representations of Vision Transformers.☆324Updated 2 years ago
- Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…☆99Updated 3 years ago
- VICRegL official code base☆226Updated 2 years ago
- EsViT: Efficient self-supervised Vision Transformers☆410Updated last year
- Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification☆131Updated 4 years ago
- [NeurIPS 2022] Official PyTorch implementation of Optimizing Relevance Maps of Vision Transformers Improves Robustness. This code allows …☆127Updated 2 years ago
- Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)☆456Updated 2 years ago
- Contains code for the paper "Vision Transformers are Robust Learners" (AAAI 2022).☆126Updated 2 years ago
- Code to reproduce the results in the FAIR research papers "Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting V…☆488Updated 2 years ago
- Implementation of popular SOTA self-supervised learning algorithms as Fastai Callbacks.☆320Updated 2 years ago
- [ICLR 2023] This repository includes the official implementation our paper "Can CNNs Be More Robust Than Transformers?"☆143Updated 2 years ago
- Easiest way of fine-tuning HuggingFace video classification models☆141Updated 2 years ago
- [NeurIPS 2021] Official codes for "Efficient Training of Visual Transformers with Small Datasets".☆141Updated 4 months ago
- [ICML 2023] Official PyTorch implementation of Global Context Vision Transformers☆432Updated last year
- The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-languag…☆226Updated 2 years ago
- Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models". https://arxiv.org/abs/2201.08371.☆179Updated 3 years ago
- Unofficial PyTorch implementation of TokenLearner by Google AI☆65Updated 2 years ago
- TF2 implementation of knowledge distillation using the "function matching" hypothesis from https://arxiv.org/abs/2106.05237.☆87Updated 3 years ago
- ☆184Updated last year
- Implementation of Online Label Smoothing in PyTorch☆94Updated 2 years ago
- (ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers☆230Updated 3 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆78Updated 3 years ago
- [CVPR 2022] Deep Spectral Methods: A Surprisingly Strong Baseline for Unsupervised Semantic Segmentation and Localization☆235Updated 2 years ago
- understanding model mistakes with human annotations☆106Updated 2 years ago
- source code for ICLR'22 paper "VOS: Learning What You Don’t Know by Virtual Outlier Synthesis"☆313Updated last year
- Repository providing a wide range of self-supervised pretrained models for computer vision tasks.☆61Updated 4 years ago
- The PASS dataset: pretrained models and how to get the data☆265Updated 2 years ago
- Code release for "Dropout Reduces Underfitting"☆313Updated 2 years ago
- Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022☆148Updated 2 years ago