purnasai / CLIP_Image_Retrieval
Image/Instance Retrieval using CLIP, A self supervised Learning Model
☆27Updated last year
Alternatives and similar repositories for CLIP_Image_Retrieval:
Users that are interested in CLIP_Image_Retrieval are comparing it to the libraries listed below
- ☆24Updated 2 months ago
- Open-vocabulary Semantic Segmentation☆34Updated last year
- ☆31Updated 3 months ago
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆38Updated 2 months ago
- Codebase for the Recognize Anything Model (RAM)☆75Updated last year
- Baby-DALL3: Annotation anything in visual tasks and Generate anything just all in one-pipeline with GPT-4 (a small baby of DALL·E 3).☆82Updated last year
- This repository is for the first survey on SAM for videos.☆35Updated last week
- ☆33Updated last year
- Open-Vocabulary Panoptic Segmentation☆23Updated 6 months ago
- ☆64Updated last year
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding☆75Updated last year
- CAPE using text-graphs☆19Updated last month
- ☆27Updated last year
- ☆58Updated last year
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.☆18Updated 2 years ago
- ☆32Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- [ECCV 2024 Oral] PetFace: A Large-Scale Dataset and Benchmark for Animal Identification https://arxiv.org/abs/2407.13555☆51Updated 5 months ago
- ☆40Updated 2 months ago
- ☆18Updated last year
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆58Updated last year
- ☆19Updated 9 months ago
- [ACM MM 2024] Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization☆13Updated 3 months ago
- ☆41Updated last year
- Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model☆49Updated 2 months ago
- The Official PyTorch Implementation of OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation☆30Updated 8 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆33Updated 9 months ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆47Updated 7 months ago
- code for paper: Simultaneous Image to Zero and Zero to Noise: Diffusion Models with Analytical Image Attenuation☆47Updated 3 weeks ago