facebookresearch / SWAGLinks

Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models". https://arxiv.org/abs/2201.08371.

☆179

Alternatives and similar repositories for SWAG

Users that are interested in SWAG are comparing it to the libraries listed below

Sorting:

facebookresearch / imagenetx
understanding model mistakes with human annotations
☆106Updated 2 years ago
salesforce / MUST
PyTorch code for MUST
☆108Updated 2 months ago
hila-chefer / RobustViT
[NeurIPS 2022] Official PyTorch implementation of Optimizing Relevance Maps of Vision Transformers Improves Robustness. This code allows …
☆131Updated 2 years ago
facebookresearch / data2vec_vision
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☆78Updated 3 years ago
microsoft / vision-longformer
☆247Updated 3 years ago
sail-sg / mugs
A PyTorch implementation of Mugs proposed by our paper "Mugs: A Multi-Granular Self-Supervised Learning Framework".
☆83Updated last year
cjrd / self-supervised-pretraining
Repository providing a wide range of self-supervised pretrained models for computer vision tasks.
☆61Updated 4 years ago
lucidrains / uniformer-pytorch
Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…
☆101Updated 3 years ago
facebookresearch / long_seq_mae
code release of research paper "Exploring Long-Sequence Masked Autoencoders"
☆100Updated 2 years ago
gaopengcuhk / Unofficial-Pix2Seq
Unofficial implementation of Pix2SEQ
☆165Updated 3 years ago
facebookresearch / OTTER
This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described …
☆69Updated 3 years ago
ziplab / Mesa
This is the official PyTorch implementation for "Mesa: A Memory-saving Training Framework for Transformers".
☆120Updated 3 years ago
facebookresearch / diht
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
☆137Updated 2 years ago
facebookresearch / asym-siam
PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)
☆99Updated 3 years ago
LightDXY / BootMAE
ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining
☆97Updated 2 years ago
valeoai / LOST
Pytorch implementation of LOST unsupervised object discovery method
☆247Updated 2 years ago
microsoft / esvit
EsViT: Efficient self-supervised Vision Transformers
☆413Updated last year
facebookresearch / Generic-Grouping
Open-source code for Generic Grouping Network (GGN, CVPR 2022)
☆111Updated 4 months ago
ZhangYuanhan-AI / OmniBenchmark
[ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.
☆108Updated last year
allenai / gpv-1
A task-agnostic vision-language architecture as a step towards General Purpose Vision
☆92Updated 4 years ago
facebookresearch / CiT
Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".
☆78Updated 2 years ago
zzd1992 / Image-Local-Attention
A better PyTorch implementation of image local attention which reduces the GPU memory by an order of magnitude.
☆141Updated 3 years ago
LightDXY / FT-CLIP
CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet
☆219Updated 2 years ago
FocalNet / Networks-Beyond-Attention
A compilation of network architectures for vision and others without usage of self-attention mechanism
☆80Updated 2 years ago
enyac-group / supmae
This is a offical PyTorch/GPU implementation of SupMAE.
☆78Updated 2 years ago
NVlabs / A-ViT
Official PyTorch implementation of A-ViT: Adaptive Tokens for Efficient Vision Transformer (CVPR 2022)
☆159Updated 3 years ago
Tete-Xiao / ReSim
PyTorch Implementation of Region Similarity Representation Learning (ReSim)
☆89Updated 3 years ago
Jiahao000 / ORL
[NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images
☆58Updated 3 years ago
ZhangYuanhan-AI / Bamboo
[IJCV] Bamboo: 4 times larger than ImageNet; 2 time larger than Object365; Built by active learning.
☆177Updated last year
mbanani / lgssl
[CVPR 2023] Learning Visual Representations via Language-Guided Sampling
☆149Updated 2 years ago