microsoft / vision-evaluation
☆35Updated last year
Alternatives and similar repositories for vision-evaluation:
Users that are interested in vision-evaluation are comparing it to the libraries listed below
- An open source implementation of CLIP.☆32Updated 2 years ago
- ☆16Updated 2 months ago
- ☆24Updated 3 years ago
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆51Updated last year
- State-of-the-art pretrained vision model from Bing Multimedia☆18Updated last year
- Load any clip model with a standardized interface☆21Updated 8 months ago
- PyTorch ObjectDetection Modules and ONNX ops☆18Updated last year
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Updated last year
- Research publication code for "Forward Compatible Training for Large-Scale Embedding Retrieval Systems", CVPR 2022, and "FastFill: Effici…☆54Updated last year
- ☆18Updated 2 years ago
- understanding model mistakes with human annotations☆106Updated last year
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆37Updated last year
- ☆11Updated 2 years ago
- A light-weight implementation of ICCV2023 paper "Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Rei…☆79Updated last year
- ViT trained on COYO-Labeled-300M dataset☆30Updated 2 years ago
- Implementation of Multistream Transformers in Pytorch☆53Updated 3 years ago
- Vision Longformer For Object Detection☆35Updated 3 years ago
- clip retrieval benchmark☆17Updated 2 years ago
- Implementation of Kronecker Attention in Pytorch☆18Updated 4 years ago
- A self-supervised learning approach based on extremely large masking☆29Updated 2 years ago
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- The ORBIT dataset is a collection of videos of objects in clean and cluttered scenes recorded by people who are blind/low-vision on a mob…☆98Updated 5 months ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆78Updated 2 years ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆50Updated 2 years ago
- ☆58Updated 10 months ago
- ☆25Updated 3 years ago
- A huge dataset for Document Visual Question Answering☆15Updated 5 months ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆35Updated 3 years ago