Official code Cross-Covariance Image Transformer (XCiT)
☆674Sep 28, 2021Updated 4 years ago
Alternatives and similar repositories for xcit
Users that are interested in xcit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official DeiT repository☆4,327Mar 15, 2024Updated 2 years ago
- PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO☆7,485Jul 3, 2024Updated last year
- A data augmentations library for audio, image, text, and video.☆5,070Mar 14, 2026Updated last week
- VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.☆3,294Mar 3, 2024Updated 2 years ago
- VOLO: Vision Outlooker for Visual Recognition☆950Sep 18, 2022Updated 3 years ago
- Code for the Convolutional Vision Transformer (ConViT)☆472Oct 25, 2021Updated 4 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆787Feb 9, 2023Updated 3 years ago
- Code release for ConvNeXt model☆6,319Jan 8, 2023Updated 3 years ago
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆608Feb 14, 2023Updated 3 years ago
- "SOLQ: Segmenting Objects by Learning Queries", SOLQ is an end-to-end instance segmentation framework with Transformer.☆200Apr 17, 2022Updated 3 years ago
- ☆73Jun 3, 2022Updated 3 years ago
- ☆246Jul 23, 2021Updated 4 years ago
- Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)☆464May 9, 2022Updated 3 years ago
- Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch☆42Apr 14, 2021Updated 4 years ago
- Codebase for Image Classification Research, written in PyTorch.☆2,166Mar 20, 2024Updated 2 years ago
- Pytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"☆433Sep 5, 2023Updated 2 years ago
- ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet☆1,194Oct 27, 2023Updated 2 years ago
- Code to reproduce the results in the FAIR research papers "Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting V…☆492Apr 28, 2023Updated 2 years ago
- Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper☆780Jan 11, 2023Updated 3 years ago
- A deep learning library for video understanding research.☆3,550Jan 12, 2026Updated 2 months ago
- Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [NeurIPS 2021]☆89Oct 2, 2021Updated 4 years ago
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆1,029Sep 29, 2022Updated 3 years ago
- [ICCV 2023] You Only Look at One Partial Sequence☆343Oct 21, 2023Updated 2 years ago
- Official repository for the "Big Transfer (BiT): General Visual Representation Learning" paper.☆1,538Jul 30, 2024Updated last year
- Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)☆1,450Mar 11, 2022Updated 4 years ago
- Self-supervised vIsion Transformer (SiT)☆337Dec 24, 2022Updated 3 years ago
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆15,782Jul 24, 2024Updated last year
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆292Apr 25, 2022Updated 3 years ago
- PyTorch implementation of SwAV https//arxiv.org/abs/2006.09882☆2,090Apr 13, 2023Updated 2 years ago
- EsViT: Efficient self-supervised Vision Transformers☆412Aug 28, 2023Updated 2 years ago
- CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022☆588Nov 1, 2023Updated 2 years ago
- ☆249Mar 16, 2022Updated 4 years ago
- Official implementation of PVT series☆1,888Oct 27, 2022Updated 3 years ago
- Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".☆1,996Mar 21, 2024Updated 2 years ago
- PyTorch implementation of Barlow Twins.☆1,002Mar 3, 2022Updated 4 years ago
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm☆677Sep 19, 2022Updated 3 years ago
- An end-to-end PyTorch framework for image and video classification☆1,613Jun 27, 2024Updated last year
- Generative Adversarial Transformers☆1,346Jun 14, 2022Updated 3 years ago
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,538Updated this week