[CVPR 2023] VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval
☆38Feb 28, 2023Updated 3 years ago
Alternatives and similar repositories for VoP
Users that are interested in VoP are comparing it to the libraries listed below
Sorting:
- ☆14Aug 28, 2024Updated last year
- PyTorch implementation of the AAAI-21 paper "Dual Adversarial Label-aware Graph Neural Networks for Cross-modal Retrieval" and the TPAMI-…☆41Nov 1, 2022Updated 3 years ago
- 😎 All your need for future is FollowGPT.☆13Nov 8, 2023Updated 2 years ago
- Graph Convolutional Network Hashing for Cross-Modal Retrieval, IJCAI2019☆13Mar 14, 2021Updated 4 years ago
- This is the implementation for the paper "Generalized Semantic Preserving Hashing for N-Label Cross-Modal Retrieval"☆14Dec 7, 2017Updated 8 years ago
- This repository contains the author's implementation in PyTorch for the paper "Adaptive Label-aware Graph Convolutional Networks for Cros…☆15Dec 6, 2021Updated 4 years ago
- ☆15Oct 17, 2022Updated 3 years ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- The top conferences on video retrieval libraries in recent years, synchronized with my blog.☆14Nov 27, 2021Updated 4 years ago
- Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)☆66Jun 7, 2024Updated last year
- 毕业设计:《基于CLIP模型的视频文本检索设计与实现》☆18Jul 21, 2024Updated last year
- Video embeddings for retrieval with natural language queries☆342Feb 15, 2023Updated 3 years ago
- Implementation of our AAAI2022 paper, Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.☆36Jun 16, 2023Updated 2 years ago
- [ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer☆37Oct 18, 2023Updated 2 years ago
- A curated list of deep learning resources for video-text retrieval.☆645Oct 20, 2023Updated 2 years ago
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)☆20Dec 4, 2021Updated 4 years ago
- The source code of "Bit-aware Semantic Transformer Hashing for Multi-modal Retrieval." (Accepted by SIGIR 2022)☆18Sep 15, 2022Updated 3 years ago
- ☆49Nov 12, 2022Updated 3 years ago
- ☆82Nov 6, 2023Updated 2 years ago
- Cross-Modal-Real-valuded-Retrieval☆86Jul 18, 2023Updated 2 years ago
- This repository contains the official implementation of CoMix (NeurIPS 2021) https://arxiv.org/pdf/2110.15128.pdf.☆22Jan 12, 2022Updated 4 years ago
- An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"☆1,026Apr 12, 2024Updated last year
- Official implementation of PCS in essay "Prompt Vision Transformer for Domain Generalization"☆50Jan 29, 2023Updated 3 years ago
- The source code for the CVPR2020 paper "Creating Something from Nothing: Unsupervised Knowledge Distillation for Cross-Modal Hashing".☆24Oct 10, 2020Updated 5 years ago
- Code and benchmarks for the Semantic Video Retrieval Task☆53Oct 18, 2022Updated 3 years ago
- 基于 LangChain 生态与混合检索技术构建的智能化学术研究辅助平台,旨在提供高效、精准的文献深度分析能力。系统支持 PDF/TXT/DOCX 多格式学术文献上传,通过 BGE-Small-ZH 向量嵌入模型与 BM25 关键词检索融合的混合检索策略,实现跨文档语义关联…☆17Aug 28, 2025Updated 6 months ago
- ☆14Jun 19, 2024Updated last year
- BiC-Net: Learning Efficient Spatio-Temporal Relation for Text-Video Retrieval☆26Jul 22, 2022Updated 3 years ago
- SUPERVAIZER is a toolkit built for the age of AI interoperability. At its core, it implements Google's Agent-to-Agent (A2A) protocol, ena…☆14Feb 4, 2026Updated last month
- Cross Modal Retrieval with Querybank Normalisation☆57Nov 21, 2023Updated 2 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- ☆259Dec 10, 2022Updated 3 years ago
- Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)☆56Mar 1, 2024Updated 2 years ago
- [TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”☆34Apr 11, 2024Updated last year
- [AAAI'25, CVPRW 2024] Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".☆126Dec 17, 2024Updated last year
- Benchmark data for "Rethinking Benchmarks for Cross-modal Image-text Retrieval" (SIGIR 2023)☆27Apr 24, 2023Updated 2 years ago
- [ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass☆198Aug 1, 2023Updated 2 years ago
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments☆14Oct 25, 2023Updated 2 years ago
- Structured TRIZ prompt engineering for LLMs in an open, portable XML format – MIT licensed.☆16Nov 11, 2025Updated 3 months ago