facebookresearch / active_indexing
Official implementation of "Active Image Indexing"
☆59Updated 2 years ago
Alternatives and similar repositories for active_indexing:
Users that are interested in active_indexing are comparing it to the libraries listed below
- understanding model mistakes with human annotations☆106Updated 2 years ago
- Code release for "Improved baselines for vision-language pre-training"☆60Updated 10 months ago
- [NeurIPS 2022] Official PyTorch implementation of Optimizing Relevance Maps of Vision Transformers Improves Robustness. This code allows …☆127Updated 2 years ago
- Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)☆160Updated last year
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated 6 months ago
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆312Updated 9 months ago
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆136Updated 2 years ago
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆88Updated 8 months ago
- Official repository of the paper "GPR1200: A Benchmark for General-PurposeContent-Based Image Retrieval"☆28Updated 7 months ago
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆55Updated 7 months ago
- A PyTorch implementation of Mugs proposed by our paper "Mugs: A Multi-Granular Self-Supervised Learning Framework".☆84Updated last year
- Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models". https://arxiv.org/abs/2201.08371.☆179Updated 2 years ago
- JAX implementation ViT-VQGAN☆82Updated 2 years ago
- Timm model explorer☆37Updated 11 months ago
- Code release for "Dropout Reduces Underfitting"☆312Updated last year
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆100Updated last year
- ViT trained on COYO-Labeled-300M dataset☆32Updated 2 years ago
- An unopinionated replacement for PyTorch's Dataset and ImageFolder, that handles Tar archives☆76Updated 2 years ago
- 1st Place Solution in Google Universal Image Embedding☆62Updated last year
- Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".☆78Updated 2 years ago
- ☆64Updated last year
- ☆102Updated last year
- [BMVC22] Official Implementation of ViCHA: "Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment"☆54Updated 2 years ago
- PyTorch code for MUST☆106Updated 2 years ago
- CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet☆213Updated 2 years ago
- Memory-Efficient CUDA kernels for training ConvNets with PyTorch.☆39Updated last month
- Efficiently read embedding in streaming from any filesystem☆99Updated 11 months ago
- ☆88Updated last year
- Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"☆56Updated last year
- Code for the Video Similarity Challenge.☆77Updated last year