dvirsamuel / PDM
Code for our paper: "Where's Waldo: Diffusion Features For Personalized Segmentation and Retrieval".
☆12Updated 2 months ago
Alternatives and similar repositories for PDM:
Users that are interested in PDM are comparing it to the libraries listed below
- Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024☆18Updated 11 months ago
- Repository of "Improving Cross-Modal Retrieval With Set of Diverse Embeddings" (CVPR'23, Highlight)☆40Updated last year
- ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization☆72Updated last year
- ☆12Updated 5 months ago
- ☆10Updated 2 months ago
- [NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding☆43Updated 3 months ago
- Official PyTorch implementation of the paper "CoVR: Learning Composed Video Retrieval from Web Video Captions".☆106Updated last month
- [ECCV 2024] Official repository for "DataDream: Few-shot Guided Dataset Generation"☆37Updated 9 months ago
- [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval☆37Updated 3 weeks ago
- Official Code for Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions☆15Updated last year
- Official Implementation of "Towards Open-Vocabulary Semantic Segmentation without Semantic Labels" (NeurIPS 2024)☆42Updated 7 months ago
- [CVPR2025] Official implementation of RAM☆14Updated last month
- Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)☆18Updated 6 months ago
- [ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models☆45Updated 4 months ago
- cliptrase☆36Updated 8 months ago
- The official repository for ECCV2024 paper "PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery"☆22Updated last month
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆40Updated 7 months ago
- [MM2024 Oral] 3D-GRES: Generalized 3D Referring Expression Segmentation☆36Updated 4 months ago
- [ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion☆37Updated 2 weeks ago
- ☆53Updated 7 months ago
- [ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"☆65Updated 10 months ago
- Official repository of the "Shatter and Gather: Learning Referring Image Segmentation with Text Supervision (ICCV'23)"☆37Updated last year
- CVPR2025: Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning☆29Updated last month
- ☆45Updated last year
- [NeurIPS 2024] Understanding Multi-Granularity for Open-Vocabulary Part Segmentation☆47Updated 4 months ago
- [ECCV 2024 Oral] Code for our paper "A Fair Ranking and New Model for Panoptic Scene Graph Generation"☆14Updated 2 months ago
- FreeDA: Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation (CVPR 2024)☆41Updated 8 months ago
- Composed Video Retrieval☆56Updated last year
- Code and data for the paper "Emergent Visual-Semantic Hierarchies in Image-Text Representations" (ECCV 2024)☆28Updated 8 months ago
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆13Updated last month