dvirsamuel / PDMLinks
Code for our paper: "Where's Waldo: Diffusion Features For Personalized Segmentation and Retrieval".
☆13Updated 3 months ago
Alternatives and similar repositories for PDM
Users that are interested in PDM are comparing it to the libraries listed below
Sorting:
- [CVPR'25] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering☆33Updated last month
- ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization☆72Updated last year
- [CVPR2025] Official implementation of RAM☆16Updated 2 months ago
- ☆11Updated 6 months ago
- [ICLR 2025] This repo is the official implementation of our paper "Learning Fine-Grained Representations through Textual Token Disentangl…☆11Updated last month
- AMES: Asymmetric and Memory-Efficient Similarity☆33Updated 7 months ago
- [ECCV 2024 Oral] Code for our paper "A Fair Ranking and New Model for Panoptic Scene Graph Generation"☆14Updated last week
- [NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding☆43Updated 4 months ago
- CVPR2025: Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning☆32Updated 2 months ago
- ☆55Updated 8 months ago
- [NeurIPS 2024] Activating Self-Attention for Multi-Scene Absolute Pose Regression☆11Updated 3 months ago
- [ECCV 2024] Official repository for "DataDream: Few-shot Guided Dataset Generation"☆39Updated 10 months ago
- Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024☆18Updated last year
- This is the project for 'USG'.☆16Updated 2 months ago
- [CVPR25] CoLLM: A Large Language Model for Composed Image Retrieval☆20Updated 2 months ago
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆41Updated 8 months ago
- 🎨Official Repo for Every Painting Awakened: A Training-free Framework for Painting-to-Animation Generation☆54Updated last month
- [NeurIPS 2024] Understanding Multi-Granularity for Open-Vocabulary Part Segmentation☆49Updated 5 months ago
- Repository of "Improving Cross-Modal Retrieval With Set of Diverse Embeddings" (CVPR'23, Highlight)☆40Updated last year
- Official Implementation of "Towards Open-Vocabulary Semantic Segmentation without Semantic Labels" (NeurIPS 2024)☆44Updated 8 months ago
- [ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models☆47Updated 4 months ago
- [ICCV'2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆35Updated last year
- Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentati…☆33Updated 4 months ago
- code for FineLIP☆23Updated 2 months ago
- Code for paper 'Leveraging Predicate and Triplet Learning for Scene Graph Generation'. (CVPR 2024)☆30Updated 3 weeks ago
- The official repository for ECCV2024 paper "PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery"☆22Updated 2 months ago
- ☆22Updated last year
- Official PyTorch implementation of the paper "CoVR: Learning Composed Video Retrieval from Web Video Captions".☆106Updated last month
- ☆10Updated 3 months ago
- Pytorch Implementation of LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification☆23Updated last month