purnasai / CLIP_Image_Retrieval
Image/Instance Retrieval using CLIP, A self supervised Learning Model
☆22Updated last year
Related projects ⓘ
Alternatives and complementary repositories for CLIP_Image_Retrieval
- ☆56Updated last year
- ☆29Updated last month
- ☆33Updated 9 months ago
- ☆30Updated last year
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆40Updated 3 months ago
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆59Updated 10 months ago
- [ICCV 2023] Official implementation of the paper "Neural Interactive Keypoint Detection"☆73Updated last year
- A practice for million-scale multi-domain universal object detection☆22Updated 4 months ago
- ☆12Updated last year
- Official Implementation for "Matching Is Not Enough: A Two-Stage Framework for Category-Agnostic Pose Estimation", CVPR 2023.☆48Updated last year
- Official implementation of High Fidelity Scene Text Synthesis.☆36Updated 2 months ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆60Updated 2 months ago
- ZIM: Zero-Shot Image Matting for Anything☆52Updated this week
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding☆75Updated last year
- ☆29Updated last year
- Official PyTorch implementation of "No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding"☆30Updated 5 months ago
- Implementation of MDP: A Generalized Framework for Text-Guided Image Editing by Manipulating the Diffusion Path☆66Updated last year
- An interactive demo based on Segment-Anything for stroke-based painting which enables human-like painting.☆34Updated last year
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆22Updated 10 months ago
- ☆43Updated 6 months ago
- [CVPR 2024 Highlight] Official GraCo: Granularity-Controllable Interactive Segmentation.☆43Updated 3 months ago
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆46Updated 6 months ago
- ☆63Updated 11 months ago
- [ECCV 2024] - ScanTalk: 3D Talking Heads from Unregistered Scans☆21Updated 2 weeks ago
- [ECCV 2024 Oral] PetFace: A Large-Scale Dataset and Benchmark for Animal Identification https://arxiv.org/abs/2407.13555☆32Updated 2 weeks ago
- ☆23Updated last week
- Official PyTorch implementation for TCSVT 23 "Detect Any Shadow: Segment Anything for Video Shadow Detection"☆52Updated 7 months ago
- Precision Search through Multi-Style Inputs☆51Updated 3 months ago
- ☆19Updated last year
- This repository is for the first survey on SAM for videos.☆17Updated this week