kakaobrain / coyo-vit
ViT trained on COYO-Labeled-300M dataset
☆32Updated 2 years ago
Alternatives and similar repositories for coyo-vit:
Users that are interested in coyo-vit are comparing it to the libraries listed below
- ALIGN trained on COYO-dataset☆29Updated 10 months ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Updated last year
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆55Updated 7 months ago
- An open source implementation of CLIP.☆32Updated 2 years ago
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated 6 months ago
- A PyTorch Dataset that caches samples in shared memory, accessible globally to all processes☆21Updated 2 years ago
- ☆17Updated last year
- ☆17Updated 2 years ago
- State-of-the-art data augmentation search algorithms in PyTorch☆47Updated last year
- A dashboard for exploring timm learning rate schedulers☆19Updated 3 months ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆77Updated 2 years ago
- Implementing DropPath/StochasticDepth in PyTorch☆16Updated 3 years ago
- This is an official implementation of GRIT-VLP☆21Updated 2 years ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆100Updated last year
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆58Updated last year
- ☆50Updated last year
- [FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.☆26Updated 2 years ago
- clip retrieval benchmark☆17Updated 2 years ago
- [ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recogniti…☆19Updated 2 years ago
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated last year
- ☆21Updated last year
- 4th place solution for the Google Universal Image Embedding Kaggle Challenge. Instance-Level Recognition workshop at ECCV 2022☆42Updated last year
- HIRL: A General Framework for Hierarchical Image Representation Learning (http://arxiv.org/abs/2205.13159)☆40Updated 2 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Updated 3 years ago
- ☆37Updated last year
- TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"☆33Updated 3 years ago
- Learning Features with Parameter-Free Layers, ICLR 2022☆85Updated last year
- ☆15Updated 2 years ago
- Official implementation of "Active Image Indexing"☆59Updated 2 years ago