kdwonn / DivE
Repository of "Improving Cross-Modal Retrieval With Set of Diverse Embeddings" (CVPR'23, Highlight)
☆39Updated last year
Alternatives and similar repositories for DivE:
Users that are interested in DivE are comparing it to the libraries listed below
- Official repository of the "ReSTR: Convolution-Free Referring Image Segmentation Using Transformers (CVPR'22)"☆12Updated 3 months ago
- Official repository of the "Shatter and Gather: Learning Referring Image Segmentation with Text Supervision (ICCV'23)"☆36Updated last year
- The official code for Devil's on the Edges: Selective Quad Attention for Scene Graph Generation, CVPR2023.☆22Updated last year
- [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval☆35Updated 7 months ago
- [ECCV 2024] Official code for "Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation"☆18Updated 6 months ago
- [CVPR 2023] Learning Attention as Disentangler for Compositional Zero-shot Learning☆39Updated last year
- Activity Grammars for Temporal Action Segmentation (NeurIPS 2023)☆12Updated 9 months ago
- Official PyTorch Implementation of Efficient and Versatile Robust Fine-Tuning of Zero-shot Models, ECCV 2024☆13Updated 5 months ago
- Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)☆17Updated 4 months ago
- [ICLR 2023] Temporal Alignment Representations with Contrastive Learning☆26Updated last year
- Official implementation of TCL (CVPR 2023)☆109Updated last year
- Official PyTorch Implementation of HIER: Metric Learning Beyond Class Labels via Hierarchical Regularization, CVPR 2023☆57Updated last year
- Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024☆18Updated 9 months ago
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆39Updated 6 months ago
- ☆34Updated last year
- ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization☆72Updated last year
- ☆16Updated 3 weeks ago
- The official code for Relational Context Learning for Human-Object Interaction Detection, CVPR2023.☆48Updated last year
- Code and data for the paper "Emergent Visual-Semantic Hierarchies in Image-Text Representations" (ECCV 2024)☆26Updated 7 months ago
- Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)☆57Updated 10 months ago
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆31Updated last year
- [CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.☆27Updated 10 months ago
- ☆47Updated 2 years ago
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆39Updated last year
- Official PyTorch implementation of the paper "CoVR: Learning Composed Video Retrieval from Web Video Captions".☆103Updated this week
- [NeurIPS 2022 Spotlight] RLIP: Relational Language-Image Pre-training and a series of other methods to solve HOI detection and Scene Grap…☆73Updated 10 months ago
- ☆12Updated 4 months ago
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"☆18Updated last month
- [BMVC'21] Official PyTorch Implementation of "Grounded Situation Recognition with Transformers"☆26Updated 2 years ago
- ☆21Updated 2 years ago