purnasai / CLIP_Image_Retrieval
Image/Instance Retrieval using CLIP, A self supervised Learning Model
☆28Updated last year
Alternatives and similar repositories for CLIP_Image_Retrieval:
Users that are interested in CLIP_Image_Retrieval are comparing it to the libraries listed below
- Odd-One-Out: Anomaly Detection by Comparing with Neighbors (CVPR25)☆33Updated 4 months ago
- ☆27Updated 3 months ago
- Open-vocabulary Semantic Segmentation☆34Updated last year
- Open-Vocabulary Panoptic Segmentation☆23Updated 7 months ago
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆58Updated last year
- ☆34Updated last year
- This repository is for the first survey on SAM & SAM2 for Videos.☆40Updated last week
- ☆28Updated last year
- OVAD: Open-vocabulary Attribute Detection code☆29Updated last year
- YOLO-World + EfficientViT SAM☆92Updated last year
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding☆75Updated last year
- Baby-DALL3: Annotation anything in visual tasks and Generate anything just all in one-pipeline with GPT-4 (a small baby of DALL·E 3).☆83Updated last year
- ☆59Updated last year
- Boosting vision transformers for image retrieval, proposed design of Deep Token Pooling(DToP)☆37Updated 2 years ago
- Visualization of the PCA as shown in Figure 1.☆22Updated last year
- ☆64Updated last year
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.☆18Updated 2 years ago
- [CBMI2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".☆23Updated 3 months ago
- Official implementation of the WACV 2024 paper CLIP-DIY☆34Updated last year
- ☆22Updated last year
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆12Updated last year
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆26Updated last year
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated 8 months ago
- ☆88Updated 3 months ago
- ☆23Updated 6 months ago
- Code for IterInpaint model, presented in Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation (CVPR 2024 work…☆25Updated 8 months ago
- Code for the paper "Benchmarking Object Detectors with COCO: A New Path Forward."☆26Updated 9 months ago
- [ICLR'23] GOOD: Exploring Geometric Cues for Detecting Objects in an Open World☆40Updated last year
- [CVPR24] Official Implementation of GEM (Grounding Everything Module)☆114Updated last week
- code for "Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation"☆14Updated 2 years ago