microsoft / vision-evaluation
☆34Updated last year
Related projects ⓘ
Alternatives and complementary repositories for vision-evaluation
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆51Updated last year
- PyTorch ObjectDetection Modules and ONNX ops☆18Updated last year
- ☆16Updated 2 weeks ago
- Official repository for the General Robust Image Task (GRIT) Benchmark☆50Updated last year
- A light-weight implementation of ICCV2023 paper "Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Rei…☆77Updated last year
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- Load any clip model with a standardized interface☆21Updated 6 months ago
- Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".☆78Updated last year
- ☆24Updated 3 years ago
- [NeurIPS 2022] The official implementation of "Learning to Discover and Detect Objects".☆108Updated last year
- understanding model mistakes with human annotations☆105Updated last year
- Vision Longformer For Object Detection☆35Updated 3 years ago
- An open source implementation of CLIP.☆32Updated 2 years ago
- ☆32Updated 2 years ago
- Research publication code for "Forward Compatible Training for Large-Scale Embedding Retrieval Systems", CVPR 2022, and "FastFill: Effici…☆53Updated last year
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆96Updated 2 months ago
- Code for T-MARS data filtering☆35Updated last year
- Official PyTorch implementation of RIO☆18Updated 3 years ago
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆37Updated last year
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 2 years ago
- ☆58Updated 8 months ago
- State-of-the-art pretrained vision model from Bing Multimedia☆18Updated last year
- Create a source of truth for ML model results and browse it on Papers with Code☆26Updated 3 years ago
- ☆18Updated 2 years ago
- The ORBIT dataset is a collection of videos of objects in clean and cluttered scenes recorded by people who are blind/low-vision on a mob…☆97Updated 3 months ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆20Updated last year
- ☆43Updated last year
- Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone☆127Updated last year
- A huge dataset for Document Visual Question Answering☆13Updated 3 months ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆50Updated 2 years ago