facebookresearch / mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
☆7,347Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for mae
- Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners☆2,602Updated last year
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".☆13,939Updated 3 months ago
- ☆10,491Updated 6 months ago
- Official DeiT repository☆4,070Updated 8 months ago
- Code release for ConvNeXt model☆5,779Updated last year
- PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722☆4,802Updated last month
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…☆20,700Updated this week
- End-to-End Object Detection with Transformers☆13,636Updated 8 months ago
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆32,320Updated this week
- Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)☆3,387Updated last year
- OpenMMLab Self-Supervised Learning Toolbox and Benchmark☆3,199Updated last year
- OpenMMLab Pre-training Toolbox and Benchmark☆3,460Updated 3 weeks ago
- Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)☆1,942Updated 2 years ago
- Deformable DETR: Deformable Transformers for End-to-End Object Detection.☆3,249Updated 6 months ago
- PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO☆6,382Updated 4 months ago
- OpenMMLab Semantic Segmentation Toolbox and Benchmark.☆8,299Updated 3 months ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆6,632Updated 3 months ago
- Scenic: A Jax Library for Computer Vision Research and Beyond☆3,334Updated last month
- Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.☆4,065Updated 4 months ago
- A deep learning library for video understanding research.☆3,334Updated 3 months ago
- SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners☆4,108Updated last year
- This is a collection of our NAS and Vision Transformer work.☆1,690Updated 3 months ago
- This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and …☆1,811Updated last year
- OpenMMLab Computer Vision Foundation☆5,909Updated this week
- RepVGG: Making VGG-style ConvNets Great Again☆3,335Updated last year
- EVA Series: Visual Representation Fantasies from BAAI☆2,312Updated 3 months ago
- An open source implementation of CLIP.☆10,367Updated last week
- PyTorch implementation of MoCo v3 https//arxiv.org/abs/2104.02057☆1,218Updated 2 years ago
- Official PyTorch implementation of SegFormer☆2,590Updated 3 months ago
- Grounded Language-Image Pre-training☆2,231Updated 9 months ago