rwightman / imagenet-12kLinks
ImageNet-12k subset of ImageNet-21k (fall11)
☆21Updated 2 years ago
Alternatives and similar repositories for imagenet-12k
Users that are interested in imagenet-12k are comparing it to the libraries listed below
Sorting:
- ☆18Updated 2 years ago
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆55Updated last year
- Un-*** 50 billions multimodality dataset☆22Updated 3 years ago
- ☆19Updated 2 years ago
- Learning Features with Parameter-Free Layers, ICLR 2022☆84Updated 2 years ago
- Implementation of Kronecker Attention in Pytorch☆19Updated 5 years ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆36Updated 4 years ago
- An open source implementation of CLIP.☆33Updated 2 years ago
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated 2 years ago
- Stochastic Optimization for Global Contrastive Learning without Large Mini-batches☆20Updated 2 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆78Updated 3 years ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆27Updated last year
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆51Updated 3 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Updated 3 years ago
- Patching open-vocabulary models by interpolating weights☆91Updated 2 years ago
- Directed masked autoencoders☆14Updated 2 years ago
- understanding model mistakes with human annotations☆106Updated 2 years ago
- [CVPR 2021] "The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models" Tianlong Chen, Jon…☆68Updated 2 years ago
- ViT trained on COYO-Labeled-300M dataset☆33Updated 2 years ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆102Updated 2 years ago
- Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch☆54Updated 4 years ago
- (ICML 2022) Official PyTorch implementation of “Blurs Behave Like Ensembles: Spatial Smoothings to Improve Accuracy, Uncertainty, and Rob…☆78Updated 3 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Updated 4 years ago
- codebase for the SIMAT dataset and evaluation☆38Updated 3 years ago
- ☆41Updated 2 years ago
- ☆37Updated 2 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18Updated 2 years ago
- Original code base for On Pretraining Data Diversity for Self-Supervised Learning☆14Updated 9 months ago
- [NeurIPS 2022] Official PyTorch implementation of Optimizing Relevance Maps of Vision Transformers Improves Robustness. This code allows …☆133Updated 2 years ago