Official code Cross-Covariance Image Transformer (XCiT)
☆674Sep 28, 2021Updated 4 years ago
Alternatives and similar repositories for xcit
Users that are interested in xcit are comparing it to the libraries listed below
Sorting:
- Official DeiT repository☆4,325Mar 15, 2024Updated last year
- PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO☆7,459Jul 3, 2024Updated last year
- VOLO: Vision Outlooker for Visual Recognition☆949Sep 18, 2022Updated 3 years ago
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,295Mar 3, 2024Updated 2 years ago
- A data augmentations library for audio, image, text, and video.☆5,071Feb 13, 2026Updated 2 weeks ago
- Code for the Convolutional Vision Transformer (ConViT)☆472Oct 25, 2021Updated 4 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆787Feb 9, 2023Updated 3 years ago
- "SOLQ: Segmenting Objects by Learning Queries", SOLQ is an end-to-end instance segmentation framework with Transformer.☆200Apr 17, 2022Updated 3 years ago
- Code release for ConvNeXt model☆6,300Jan 8, 2023Updated 3 years ago
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆608Feb 14, 2023Updated 3 years ago
- ☆246Jul 23, 2021Updated 4 years ago
- Codebase for Image Classification Research, written in PyTorch.☆2,168Mar 20, 2024Updated last year
- Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"☆433Sep 5, 2023Updated 2 years ago
- A deep learning library for video understanding research.☆3,544Jan 12, 2026Updated last month
- Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)☆463May 9, 2022Updated 3 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆291Apr 25, 2022Updated 3 years ago
- Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper☆780Jan 11, 2023Updated 3 years ago
- ☆73Jun 3, 2022Updated 3 years ago
- Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [NeurIPS 2021]☆89Oct 2, 2021Updated 4 years ago
- ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet☆1,192Oct 27, 2023Updated 2 years ago
- Official repository for the "Big Transfer (BiT): General Visual Representation Learning" paper.☆1,539Jul 30, 2024Updated last year
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆1,024Sep 29, 2022Updated 3 years ago
- PyTorch implementation of SwAV https//arxiv.org/abs/2006.09882☆2,089Apr 13, 2023Updated 2 years ago
- [ICCV 2023] You Only Look at One Partial Sequence☆343Oct 21, 2023Updated 2 years ago
- EsViT: Efficient self-supervised Vision Transformers☆411Aug 28, 2023Updated 2 years ago
- ☆249Mar 16, 2022Updated 3 years ago
- Code to reproduce the results in the FAIR research papers "Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting V…☆492Apr 28, 2023Updated 2 years ago
- Official implementation of PVT series☆1,887Oct 27, 2022Updated 3 years ago
- Self-supervised vIsion Transformer (SiT)☆337Dec 24, 2022Updated 3 years ago
- Official Pytorch implementation of ReXNet (Rank eXpansion Network) with pretrained models☆452Jan 30, 2022Updated 4 years ago
- Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".☆1,999Mar 21, 2024Updated last year
- ☆110Sep 15, 2021Updated 4 years ago
- CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022☆589Nov 1, 2023Updated 2 years ago
- Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch☆42Apr 14, 2021Updated 4 years ago
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆15,716Jul 24, 2024Updated last year
- PyTorch implementation of Barlow Twins.☆1,002Mar 3, 2022Updated 4 years ago
- Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)☆1,450Mar 11, 2022Updated 3 years ago
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm☆675Sep 19, 2022Updated 3 years ago
- Un-Mix: Rethinking Image Mixtures for Unsupervised Visual Representation Learning.☆151Aug 10, 2022Updated 3 years ago