Aasthaengg / GLIP-BLIP-Vision-Langauge-Obj-Det-VQA
☆33Updated 2 years ago
Alternatives and similar repositories for GLIP-BLIP-Vision-Langauge-Obj-Det-VQA
Users that are interested in GLIP-BLIP-Vision-Langauge-Obj-Det-VQA are comparing it to the libraries listed below
Sorting:
- A simple wrapper library for binding timm models as detectron2 backbones☆42Updated last year
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆35Updated last year
- ☆88Updated last year
- Official repository of the paper "GPR1200: A Benchmark for General-PurposeContent-Based Image Retrieval"☆28Updated last month
- ☆18Updated 2 years ago
- Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts☆38Updated 8 months ago
- [CVPR 2023 Highlight] Beyond mAP: Towards better evaluation of instance segmentation☆26Updated 2 years ago
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆68Updated 2 years ago
- Official PyTorch implementation of RIO☆18Updated 3 years ago
- EdgeSAM model for use with Autodistill.☆26Updated 11 months ago
- A task-agnostic vision-language architecture as a step towards General Purpose Vision☆92Updated 3 years ago
- Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally☆18Updated 10 months ago
- Tracking through Containers and Occluders in the Wild (CVPR 2023) - Official Implementation☆41Updated 11 months ago
- HIRL: A General Framework for Hierarchical Image Representation Learning (http://arxiv.org/abs/2205.13159)☆40Updated 2 years ago
- [IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformer☆103Updated last year
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated 8 months ago
- [FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.☆26Updated 2 years ago
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆58Updated last year
- Detection Transformers with Assignment☆253Updated last year
- 4th place solution for the Google Universal Image Embedding Kaggle Challenge. Instance-Level Recognition workshop at ECCV 2022☆42Updated last year
- Official Pytorch Implementation of Self-emerging Token Labeling☆33Updated last year
- Official repository for the General Robust Image Task (GRIT) Benchmark☆54Updated 2 years ago
- [ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)☆186Updated last year
- [CVPR 2023] implementation of Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information.☆91Updated last year
- Object Recognition as Next Token Prediction (CVPR 2024 Highlight)☆177Updated 2 weeks ago
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆56Updated last year
- Run zero-shot prediction models on your data☆32Updated 4 months ago
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆137Updated 2 years ago
- Auto Segmentation label generation with SAM (Segment Anything) + Grounding DINO☆19Updated 3 months ago
- ☆64Updated last year