purnasai / CLIP_Image_Retrieval
Image/Instance Retrieval using CLIP, A self supervised Learning Model
☆24Updated last year
Alternatives and similar repositories for CLIP_Image_Retrieval:
Users that are interested in CLIP_Image_Retrieval are comparing it to the libraries listed below
- ☆30Updated last month
- Edge Weight Prediction For Category-Agnostic Pose Estimation☆36Updated last month
- code for paper: Simultaneous Image to Zero and Zero to Noise: Diffusion Models with Analytical Image Attenuation☆43Updated 8 months ago
- ☆57Updated last year
- Official PyTorch implementation of Self-Supervised Any-Point Tracking by Contrastive Random Walks, ECCV 2024.☆46Updated 2 months ago
- This repository is for the first survey on SAM for videos.☆30Updated this week
- ☆86Updated 2 weeks ago
- An interactive demo based on Segment-Anything for stroke-based painting which enables human-like painting.☆34Updated last year
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆24Updated last year
- LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding☆15Updated 2 weeks ago
- ☆28Updated last year
- ☆62Updated last year
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding☆75Updated last year
- Boosting vision transformers for image retrieval, proposed design of Deep Token Pooling(DToP)☆37Updated 2 years ago
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆52Updated last week
- ☆19Updated last year
- Open-Vocabulary Panoptic Segmentation☆21Updated 4 months ago
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Updated 6 months ago
- ☆25Updated 3 months ago
- This repo contains extensions to DINO V2 model by Meta, and awesome applications built on top of it.☆38Updated last year
- ☆57Updated last year
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆41Updated 5 months ago
- [ECCV 2024] - ScanTalk: 3D Talking Heads from Unregistered Scans☆31Updated 3 months ago
- Personalized Representation from Personalized Generation☆49Updated last month
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆59Updated last year
- [CVPR24] Official Implementation of GEM (Grounding Everything Module)☆107Updated 3 months ago
- Training code for CLIP-FlanT5☆22Updated 6 months ago
- ☆26Updated last year
- [NeurIPS 2023] HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception☆40Updated 10 months ago