Aasthaengg / GLIP-BLIP-Vision-Langauge-Obj-Det-VQALinks
☆32Updated 2 years ago
Alternatives and similar repositories for GLIP-BLIP-Vision-Langauge-Obj-Det-VQA
Users that are interested in GLIP-BLIP-Vision-Langauge-Obj-Det-VQA are comparing it to the libraries listed below
Sorting:
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Updated last year
- Official PyTorch implementation of RIO☆18Updated 3 years ago
- Official repository for the General Robust Image Task (GRIT) Benchmark☆54Updated 2 years ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated 8 months ago
- A task-agnostic vision-language architecture as a step towards General Purpose Vision☆92Updated 3 years ago
- ☆87Updated last year
- Detectron2 Toolbox and Benchmark for V3Det☆17Updated last year
- Object Recognition as Next Token Prediction (CVPR 2024 Highlight)☆178Updated last month
- A simple wrapper library for binding timm models as detectron2 backbones☆43Updated 2 years ago
- [FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.☆26Updated 2 years ago
- Code release for "Language-conditioned Detection Transformer"☆87Updated 11 months ago
- [CVPR 2023 Highlight] Beyond mAP: Towards better evaluation of instance segmentation☆26Updated 2 years ago
- ALIGN trained on COYO-dataset☆29Updated last year
- A pytorch Implementation of Open Vocabulary Object Detection with Pseudo Bounding-Box Labels☆60Updated 2 years ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆33Updated last year
- EdgeSAM model for use with Autodistill.☆26Updated 11 months ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆17Updated 2 years ago
- Vision-oriented multimodal AI☆49Updated 11 months ago
- In-the-wild Question Answering☆15Updated 2 years ago
- Our public repo ranked 1st 🏆🏆 at MMSports2023 challenge on segmentation task☆17Updated last year
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆68Updated 2 years ago
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.☆19Updated 3 years ago
- Code for our ICLR 2024 paper "PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts"☆77Updated last year
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Updated 2 years ago
- ☆58Updated last year
- (NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection☆116Updated last year
- A curated list of papers and resources for text-to-image evaluation.☆29Updated last year
- [NeurIPS 2022] The official implementation of "Learning to Discover and Detect Objects".☆111Updated last year
- ☆64Updated last year
- A Survey on video and language understanding.☆50Updated 2 years ago