facebookresearch / data2vec_visionLinks

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

☆78

Alternatives and similar repositories for data2vec_vision

Users that are interested in data2vec_vision are comparing it to the libraries listed below

Sorting:

hila-chefer / RobustViT
[NeurIPS 2022] Official PyTorch implementation of Optimizing Relevance Maps of Vision Transformers Improves Robustness. This code allows …
☆131Updated 2 years ago
IntelLabs / VL-InterpreT
Visual Language Transformer Interpreter - An interactive visualization tool for interpreting vision-language transformers
☆94Updated last year
salesforce / MUST
PyTorch code for MUST
☆107Updated 3 months ago
redcaps-dataset / redcaps-downloader
Command-line tool for downloading and extending the RedCaps dataset.
☆48Updated last year
facebookresearch / diht
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
☆138Updated 2 years ago
mlfoundations / patching
Patching open-vocabulary models by interpolating weights
☆91Updated last year
zinengtang / Perceiver_VL
PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)
☆33Updated 2 years ago
facebookresearch / SWAG
Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models". https://arxiv.org/abs/2201.08371.
☆179Updated 3 years ago
enyac-group / supmae
This is a offical PyTorch/GPU implementation of SupMAE.
☆78Updated 2 years ago
facebookresearch / imagenetx
understanding model mistakes with human annotations
☆106Updated 2 years ago
alextamkin / dabs
A Domain-Agnostic Benchmark for Self-Supervised Learning
☆107Updated 2 years ago
naver-ai / seit
[ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT
☆55Updated 11 months ago
Weixin-Liang / MetaShift
MetaShift: A Dataset of Datasets for Evaluating Contextual Distribution Shifts and Training Conflicts (ICLR 2022)
☆109Updated 2 years ago
goel-shashank / CyCLIP
☆120Updated 2 years ago
facebookresearch / CiT
Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".
☆78Updated 2 years ago
allenai / gpv2
☆32Updated 3 years ago
joaanna / disentangling_spelling_in_clip
☆34Updated 2 years ago
sail-sg / mugs
A PyTorch implementation of Mugs proposed by our paper "Mugs: A Multi-Granular Self-Supervised Learning Framework".
☆83Updated last year
VITA-Group / AsViT
[ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…
☆76Updated 3 years ago
kakaobrain / noc
☆46Updated last year
allenai / gpv-1
A task-agnostic vision-language architecture as a step towards General Purpose Vision
☆92Updated 4 years ago
LAION-AI / Big-Interleaved-Dataset
Big-Interleaved-Dataset
☆58Updated 2 years ago
facebookresearch / OTTER
This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described …
☆69Updated 3 years ago
lucidrains / multistream-transformers
Implementation of Multistream Transformers in Pytorch
☆54Updated 4 years ago
TomerRonen34 / mixed-resolution-vit
☆51Updated last year
mlfoundations / imagenet-captions
Release of ImageNet-Captions
☆50Updated 2 years ago
facebookresearch / SIMAT
codebase for the SIMAT dataset and evaluation
☆38Updated 3 years ago
NVlabs / PALAVRA
☆52Updated 3 years ago
lucidrains / long-short-transformer
Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch
☆119Updated 4 years ago
songweige / Contrastive-Learning-with-Non-Semantic-Negatives
Robust Contrastive Learning Using Negative Samples with Diminished Semantics (NeurIPS 2021)
☆39Updated 3 years ago