kakaobrain / coyo-vitLinks
ViT trained on COYO-Labeled-300M dataset
☆32Updated 2 years ago
Alternatives and similar repositories for coyo-vit
Users that are interested in coyo-vit are comparing it to the libraries listed below
Sorting:
- State-of-the-art data augmentation search algorithms in PyTorch☆47Updated 2 years ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Updated 2 years ago
- An open source implementation of CLIP.☆33Updated 2 years ago
- Official implementation of "Active Image Indexing"☆59Updated 2 years ago
- ALIGN trained on COYO-dataset☆29Updated last year
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆103Updated last year
- A PyTorch Dataset that caches samples in shared memory, accessible globally to all processes☆22Updated 3 years ago
- Code for paper Rethinking the Data Annotation Process for Multi-view 3D Pose Estimation with Active Learning and Self-Training☆22Updated 2 years ago
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆55Updated last year
- A non-JIT version implementation / replication of CLIP of OpenAI in pytorch☆34Updated 4 years ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated last year
- clip retrieval benchmark☆17Updated 3 years ago
- Learning Features with Parameter-Free Layers, ICLR 2022☆84Updated 2 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆78Updated 3 years ago
- ☆19Updated 2 years ago
- Implementing DropPath/StochasticDepth in PyTorch☆16Updated 3 years ago
- Load any clip model with a standardized interface☆22Updated last week
- Official repository of the paper "GPR1200: A Benchmark for General-PurposeContent-Based Image Retrieval"☆29Updated 5 months ago
- Official Pytorch implementation of the paper: "Locally Shifted Attention With Early Global Integration"☆15Updated 3 years ago
- 🐝 Explore Trending Papers at CVPR☆55Updated 4 years ago
- understanding model mistakes with human annotations☆106Updated 2 years ago
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated 2 years ago
- ☆17Updated 2 years ago
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆68Updated 2 years ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆51Updated 3 years ago
- ☆52Updated last year
- Un-*** 50 billions multimodality dataset☆23Updated 3 years ago
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆57Updated last year
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Updated 3 years ago
- HIRL: A General Framework for Hierarchical Image Representation Learning (http://arxiv.org/abs/2205.13159)☆40Updated 3 years ago