SforAiDl / vformer
A modular PyTorch library for vision transformer models
☆161Updated last year
Alternatives and similar repositories for vformer:
Users that are interested in vformer are comparing it to the libraries listed below
- Pytorch implementation of LOST unsupervised object discovery method☆239Updated last year
- Probing the representations of Vision Transformers.☆319Updated 2 years ago
- Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification☆130Updated 3 years ago
- Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)☆454Updated 2 years ago
- Easiest way of fine-tuning HuggingFace video classification models☆136Updated last year
- [CVPR 2022] Deep Spectral Methods: A Surprisingly Strong Baseline for Unsupervised Semantic Segmentation and Localization☆233Updated last year
- EsViT: Efficient self-supervised Vision Transformers☆409Updated last year
- VICRegL official code base☆224Updated last year
- A summarization of Transformer-based architectures for CV tasks, including image classification, object detection, segmentation, and Few-…☆107Updated 2 years ago
- Code to reproduce the results in the FAIR research papers "Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting V…☆487Updated last year
- Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…☆98Updated 2 years ago
- Implementation of popular SOTA self-supervised learning algorithms as Fastai Callbacks.☆318Updated last year
- ☆181Updated last year
- [NeurIPS 2022] Official PyTorch implementation of Optimizing Relevance Maps of Vision Transformers Improves Robustness. This code allows …☆127Updated 2 years ago
- Official implementation of the CVPR 2022 paper "DETReg: Unsupervised Pretraining with Region Priors for Object Detection".☆334Updated last year
- [NeurIPS 2021] Official codes for "Efficient Training of Visual Transformers with Small Datasets".☆140Updated 2 weeks ago
- Fine-tune Facebook's DETR (DEtection TRansformer) on Colaboratory.☆142Updated last year
- [ICML 2023] Official PyTorch implementation of Global Context Vision Transformers☆426Updated last year
- Self-Supervised Learning in PyTorch☆132Updated 10 months ago
- [ECCV'22] Official repository of paper titled "Class-agnostic Object Detection with Multi-modal Transformer".☆303Updated last year
- source code for ICLR'22 paper "VOS: Learning What You Don’t Know by Virtual Outlier Synthesis"☆309Updated last year
- This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.☆160Updated 3 years ago
- MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022☆560Updated 2 years ago
- understanding model mistakes with human annotations☆106Updated last year
- Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022☆148Updated 2 years ago
- A toolbox for receptive field analysis and visualizing neural network architectures☆113Updated 3 weeks ago
- FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.☆202Updated last year
- (ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers☆231Updated 2 years ago
- LightCollections⚡️: Ready-to-use implementations such as `LightningModules` for various computer vision papers.☆22Updated 2 years ago
- Repository providing a wide range of self-supervised pretrained models for computer vision tasks.☆62Updated 3 years ago