purnasai / CLIP_Image_RetrievalLinks
Image/Instance Retrieval using CLIP, A self supervised Learning Model
☆29Updated 2 years ago
Alternatives and similar repositories for CLIP_Image_Retrieval
Users that are interested in CLIP_Image_Retrieval are comparing it to the libraries listed below
Sorting:
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆58Updated last year
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding☆76Updated last year
- Code for IterInpaint model, presented in Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation (CVPR 2024 work…☆25Updated last year
- Baby-DALL3: Annotation anything in visual tasks and Generate anything just all in one-pipeline with GPT-4 (a small baby of DALL·E 3).☆83Updated last year
- ☆93Updated last month
- [ICLR'23] GOOD: Exploring Geometric Cues for Detecting Objects in an Open World☆38Updated 2 years ago
- Code for recreating the HoS benchmark of VISOR☆22Updated 2 years ago
- Odd-One-Out: Anomaly Detection by Comparing with Neighbors (CVPR25)☆47Updated 8 months ago
- DAGM GCPR 2023 Paper: HiFiHR: Enhancing 3D Hand Reconstruction from a Single Image via High-Fidelity Texture☆26Updated last year
- [CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network☆106Updated last month
- Open-Vocabulary Panoptic Segmentation☆26Updated 2 months ago
- Official code for "Opening up Open World Tracking" (CVPR 2022)☆56Updated 2 years ago
- A practice for million-scale multi-domain universal object detection☆28Updated last year
- Official PyTorch implementation of Self-Supervised Any-Point Tracking by Contrastive Random Walks, ECCV 2024.☆52Updated 9 months ago
- Code for "TAG: Guidance-free Open-Vocabulary Semantic Segmentation"☆15Updated last year
- ☆31Updated 7 months ago
- ☆20Updated 2 years ago
- Vision-oriented multimodal AI☆49Updated last year
- Generative model for 3D objects.☆17Updated 2 years ago
- [CVPR'2025] EntitySAM: Segment Everything in Video☆41Updated last month
- Official Implementation for "Matching Is Not Enough: A Two-Stage Framework for Category-Agnostic Pose Estimation", CVPR 2023.☆52Updated 2 years ago
- Open-vocabulary Semantic Segmentation☆33Updated last year
- Scaling Properties of Diffusion Models For Perceptual Tasks (CVPR 2025)☆42Updated 3 months ago
- ☆39Updated last year
- This repository is for the paper "Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding…☆20Updated last year
- [NeurIPS2022] This is the official implementation of the paper "Expediting Large-Scale Vision Transformer for Dense Prediction without Fi…☆85Updated last year
- This repository is for the first survey on SAM & SAM2 for Videos.☆52Updated 4 months ago
- ☆14Updated 2 years ago
- 1-shot image segmentation using Stable Diffusion☆141Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year