kdwonn / DivE
Repository of "Improving Cross-Modal Retrieval With Set of Diverse Embeddings" (CVPR'23, Highlight)
☆36Updated last year
Alternatives and similar repositories for DivE:
Users that are interested in DivE are comparing it to the libraries listed below
- Official repository of the "ReSTR: Convolution-Free Referring Image Segmentation Using Transformers (CVPR'22)"☆12Updated last month
- The official code for Devil's on the Edges: Selective Quad Attention for Scene Graph Generation, CVPR2023.☆22Updated last year
- Official repository of the "Shatter and Gather: Learning Referring Image Segmentation with Text Supervision (ICCV'23)"☆35Updated 11 months ago
- [ECCV 2024] Official code for "Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation"☆17Updated 3 months ago
- [CVPR 2023] Learning Attention as Disentangler for Compositional Zero-shot Learning☆40Updated last year
- ☆15Updated 4 months ago
- Activity Grammars for Temporal Action Segmentation (NeurIPS 2023)☆12Updated 7 months ago
- Official implementation of TCL (CVPR 2023)☆110Updated last year
- Official PyTorch Implementation of Efficient and Versatile Robust Fine-Tuning of Zero-shot Models, ECCV 2024☆13Updated 3 months ago
- ☆35Updated last year
- ☆12Updated last month
- ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization☆68Updated 11 months ago
- [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval☆33Updated 4 months ago
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆38Updated 3 months ago
- Code and data for the paper "Emergent Visual-Semantic Hierarchies in Image-Text Representations" (ECCV 2024)☆25Updated 5 months ago
- The official code for Relational Context Learning for Human-Object Interaction Detection, CVPR2023.☆48Updated last year
- Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)☆14Updated 2 months ago
- ☆47Updated 2 years ago
- [CVPR'24] Official PyTorch implementation of Contrastive Mean-Shift Learning for Generalized Category Discovery☆42Updated 8 months ago
- What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text Inputs☆23Updated 2 years ago
- Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024☆18Updated 7 months ago
- ☆22Updated last year
- [ICCV'2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆32Updated last year
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆39Updated last year
- Official repository of the "Active Learning for Semantic Segmentation with Multi-class Label Query (NeurIPS'23)"☆12Updated last year
- ☆23Updated 4 months ago
- ☆19Updated last year
- Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021☆66Updated 2 years ago
- [CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval☆47Updated 6 months ago