bighuang624 / VoPView external linksLinks
[CVPR 2023] VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval
☆38Feb 28, 2023Updated 2 years ago
Alternatives and similar repositories for VoP
Users that are interested in VoP are comparing it to the libraries listed below
Sorting:
- ☆13Aug 28, 2024Updated last year
- Graph Convolutional Network Hashing for Cross-Modal Retrieval, IJCAI2019☆13Mar 14, 2021Updated 4 years ago
- Label Embedding Online Hashing for Cross-Modal Retrieval☆13Sep 22, 2025Updated 4 months ago
- This is the implementation for the paper "Generalized Semantic Preserving Hashing for N-Label Cross-Modal Retrieval"☆14Dec 7, 2017Updated 8 years ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- ☆15Oct 17, 2022Updated 3 years ago
- The top conferences on video retrieval libraries in recent years, synchronized with my blog.☆14Nov 27, 2021Updated 4 years ago
- Video embeddings for retrieval with natural language queries☆342Feb 15, 2023Updated 3 years ago
- Implementation of our AAAI2022 paper, Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.☆36Jun 16, 2023Updated 2 years ago
- A curated list of deep learning resources for video-text retrieval.☆642Oct 20, 2023Updated 2 years ago
- [ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer☆37Oct 18, 2023Updated 2 years ago
- The Demo of Our CVPR paper "Cross-Modality Binary Code Learning via Fusion Similarity Hashing"☆14Sep 7, 2017Updated 8 years ago
- Evaluate robustness of adaptation methods on large vision-language models☆19Aug 23, 2023Updated 2 years ago
- The source code of "Bit-aware Semantic Transformer Hashing for Multi-modal Retrieval." (Accepted by SIGIR 2022)☆18Sep 15, 2022Updated 3 years ago
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)☆20Dec 4, 2021Updated 4 years ago
- ☆49Nov 12, 2022Updated 3 years ago
- On-Device Domain Generalization☆46Nov 9, 2022Updated 3 years ago
- ☆80Nov 6, 2023Updated 2 years ago
- Cross-Modal-Real-valuded-Retrieval☆87Jul 18, 2023Updated 2 years ago
- This repository contains the official implementation of CoMix (NeurIPS 2021) https://arxiv.org/pdf/2110.15128.pdf.☆22Jan 12, 2022Updated 4 years ago
- An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"☆1,023Apr 12, 2024Updated last year
- Official implementation of PCS in essay "Prompt Vision Transformer for Domain Generalization"☆50Jan 29, 2023Updated 3 years ago
- The source code for the CVPR2020 paper "Creating Something from Nothing: Unsupervised Knowledge Distillation for Cross-Modal Hashing".☆24Oct 10, 2020Updated 5 years ago
- Code and benchmarks for the Semantic Video Retrieval Task☆53Oct 18, 2022Updated 3 years ago
- SUPERVAIZER is a toolkit built for the age of AI interoperability. At its core, it implements Google's Agent-to-Agent (A2A) protocol, ena…☆14Feb 4, 2026Updated last week
- ☆26Mar 20, 2023Updated 2 years ago
- BiC-Net: Learning Efficient Spatio-Temporal Relation for Text-Video Retrieval☆26Jul 22, 2022Updated 3 years ago
- Cross Modal Retrieval with Querybank Normalisation☆57Nov 21, 2023Updated 2 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- ☆259Dec 10, 2022Updated 3 years ago
- Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)☆56Mar 1, 2024Updated last year
- [AAAI'25, CVPRW 2024] Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".☆121Dec 17, 2024Updated last year
- [ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass☆198Aug 1, 2023Updated 2 years ago
- Structured TRIZ prompt engineering for LLMs in an open, portable XML format – MIT licensed.☆14Nov 11, 2025Updated 3 months ago
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments☆14Oct 25, 2023Updated 2 years ago
- USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024☆33Jun 18, 2025Updated 7 months ago
- ☆26Jan 12, 2022Updated 4 years ago
- Source code of our TCYB 2018 paper "SCH-GAN: Semi-supervised Cross-modal Hashing by Generative Adversarial Network"☆27Aug 31, 2018Updated 7 years ago
- [CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".☆305Apr 3, 2024Updated last year