purnasai / CLIP_Image_RetrievalLinks
Image/Instance Retrieval using CLIP, A self supervised Learning Model
☆28Updated 2 years ago
Alternatives and similar repositories for CLIP_Image_Retrieval
Users that are interested in CLIP_Image_Retrieval are comparing it to the libraries listed below
Sorting:
- Odd-One-Out: Anomaly Detection by Comparing with Neighbors (CVPR25)☆42Updated 6 months ago
- ☆34Updated last year
- Open-vocabulary Semantic Segmentation☆33Updated last year
- Baby-DALL3: Annotation anything in visual tasks and Generate anything just all in one-pipeline with GPT-4 (a small baby of DALL·E 3).☆83Updated last year
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆12Updated last year
- [CVPR25] CoLLM: A Large Language Model for Composed Image Retrieval☆25Updated 3 months ago
- ☆34Updated 2 years ago
- ☆20Updated last year
- ☆31Updated last year
- CAPE using text-graphs☆22Updated 2 months ago
- Open-Vocabulary Panoptic Segmentation☆24Updated last week
- ☆91Updated last month
- Official Implementation for "Matching Is Not Enough: A Two-Stage Framework for Category-Agnostic Pose Estimation", CVPR 2023.☆52Updated last year
- LAVIS - A One-stop Library for Language-Vision Intelligence☆48Updated 10 months ago
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆58Updated last year
- ☆19Updated last year
- Official PyTorch implementation of Self-Supervised Any-Point Tracking by Contrastive Random Walks, ECCV 2024.☆52Updated 7 months ago
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding☆76Updated last year
- Code for IterInpaint model, presented in Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation (CVPR 2024 work…☆25Updated 11 months ago
- Edge Weight Prediction For Category-Agnostic Pose Estimation☆41Updated last month
- ☆29Updated 5 months ago
- OVAD: Open-vocabulary Attribute Detection code☆30Updated last year
- ☆17Updated last year
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆26Updated last year
- A visual LLM for image region description or QA.☆16Updated last year
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆43Updated 5 months ago
- This repository is for the first survey on SAM & SAM2 for Videos.☆51Updated last month
- AMES: Asymmetric and Memory-Efficient Similarity☆35Updated 8 months ago
- ☆69Updated last year
- Precision Search through Multi-Style Inputs☆70Updated 2 months ago