kdwonn / DivELinks
Repository of "Improving Cross-Modal Retrieval With Set of Diverse Embeddings" (CVPR'23, Highlight)
☆41Updated last year
Alternatives and similar repositories for DivE
Users that are interested in DivE are comparing it to the libraries listed below
Sorting:
- Official repository of the "ReSTR: Convolution-Free Referring Image Segmentation Using Transformers (CVPR'22)"☆15Updated 10 months ago
- The official code for Devil's on the Edges: Selective Quad Attention for Scene Graph Generation, CVPR2023.☆24Updated 2 years ago
- Official repository of the "Shatter and Gather: Learning Referring Image Segmentation with Text Supervision (ICCV'23)"☆39Updated last year
- ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization☆73Updated last year
- [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval☆41Updated 6 months ago
- Official PyTorch implementation of the paper "CoVR: Learning Composed Video Retrieval from Web Video Captions".☆115Updated 3 weeks ago
- Official PyTorch Implementation of Efficient and Versatile Robust Fine-Tuning of Zero-shot Models, ECCV 2024☆17Updated last year
- [ECCV 2024] Official code for "Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation"☆18Updated 3 months ago
- Official implementation of TCL (CVPR 2023)☆118Updated 2 years ago
- [ICLR 2025] This repo is the official implementation of our paper "Learning Fine-Grained Representations through Textual Token Disentangl…☆20Updated 3 months ago
- Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)☆20Updated 11 months ago
- ☆95Updated 3 years ago
- Official code for Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions (CVPR 2024)☆26Updated last year
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆46Updated last year
- [CVPR 2023] Learning Attention as Disentangler for Compositional Zero-shot Learning☆39Updated 2 years ago
- Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024☆20Updated last year
- Cross Modal Retrieval with Querybank Normalisation☆56Updated last year
- ☆26Updated last year
- [ICLR 2023] Temporal Alignment Representations with Contrastive Learning☆27Updated 2 years ago
- Official Code for Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions☆16Updated last year
- [CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval☆61Updated last year
- ☆36Updated 2 years ago
- [ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"☆80Updated last year
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆78Updated 2 months ago
- Code and data for the paper "Emergent Visual-Semantic Hierarchies in Image-Text Representations" (ECCV 2024)☆31Updated last year
- [CVPR 2023 Highlight & TPAMI] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning☆121Updated 10 months ago
- ☆30Updated 2 years ago
- [CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》☆151Updated 2 years ago
- ☆88Updated 3 years ago
- [CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation☆64Updated 3 months ago