bighuang624/VoP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bighuang624/VoP)

bighuang624 / VoP

[CVPR 2023] VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval

☆38

Alternatives and similar repositories for VoP

Users that are interested in VoP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Lilidamowang / T2VIndexer-generativeSearch
View on GitHub
☆16Aug 28, 2024Updated last year
CrossmodalGroup / CMCAN
View on GitHub
Implementation of our AAAI2022 paper, Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.
☆36Jun 16, 2023Updated 3 years ago
LgQu / TIGeR
View on GitHub
Code for paper: Unified Text-to-Image Generation and Retrieval
☆16Jul 19, 2026Updated last week
LivXue / GNN4CMR
View on GitHub
PyTorch implementation of the AAAI-21 paper "Dual Adversarial Label-aware Graph Neural Networks for Cross-modal Retrieval" and the TPAMI-…
☆42Nov 1, 2022Updated 3 years ago
DeXie0808 / GCH
View on GitHub
Graph Convolutional Network Hashing for Cross-Modal Retrieval, IJCAI2019
☆13Mar 14, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Ziyang412 / UCoFiA
View on GitHub
Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)
☆66Jun 7, 2024Updated 2 years ago
liuxiaolei88 / Awesome-Text2Video-Retrieval
View on GitHub
The top conferences on video retrieval libraries in recent years, synchronized with my blog.
☆14Nov 27, 2021Updated 4 years ago
HuangYuantong / video-text-retrieval
View on GitHub
毕业设计：《基于CLIP模型的视频文本检索设计与实现》
☆18Jul 21, 2024Updated 2 years ago
albanie / collaborative-experts
View on GitHub
Video embeddings for retrieval with natural language queries
☆344Feb 15, 2023Updated 3 years ago
danieljf24 / awesome-video-text-retrieval
View on GitHub
A curated list of deep learning resources for video-text retrieval.
☆644Oct 20, 2023Updated 2 years ago
yxinwang / LEMON-MM2020
View on GitHub
Label Embedding Online Hashing for Cross-Modal Retrieval
☆13Sep 22, 2025Updated 10 months ago
park-jungin / DualPath
View on GitHub
☆49Nov 12, 2022Updated 3 years ago
FutureTwT / BSTH
View on GitHub
The source code of "Bit-aware Semantic Transformer Hashing for Multi-modal Retrieval." (Accepted by SIGIR 2022)
☆18Sep 15, 2022Updated 3 years ago
KaiyangZhou / on-device-dg
View on GitHub
On-Device Domain Generalization
☆47Nov 9, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
LivXue / ALGCN
View on GitHub
This repository contains the author's implementation in PyTorch for the paper "Adaptive Label-aware Graph Convolutional Networks for Cros…
☆15Dec 6, 2021Updated 4 years ago
Xiaodongsuper / Entity-Graph-Enhanced-Cross-Modal-Pretraining-for-Instance-level-Product-Retrieval
View on GitHub
☆15Oct 17, 2022Updated 3 years ago
liuting20 / MaPPER
View on GitHub
[EMNLP 2024 Main] MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehension
☆16Jan 6, 2025Updated last year
lionel-hing / BiC-Net
View on GitHub
BiC-Net: Learning Efficient Spatio-Temporal Relation for Text-Video Retrieval
☆28Jul 22, 2022Updated 4 years ago
Lyn-L / FSH
View on GitHub
The Demo of Our CVPR paper "Cross-Modality Binary Code Learning via Fusion Similarity Hashing"
☆14Sep 7, 2017Updated 8 years ago
ArrowLuo / CLIP4Clip
View on GitHub
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
☆1,030Apr 12, 2024Updated 2 years ago
XLiu443 / Tem-adapter
View on GitHub
[ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer
☆37Oct 18, 2023Updated 2 years ago
CVIR / CoMix
View on GitHub
This repository contains the official implementation of CoMix (NeurIPS 2021) https://arxiv.org/pdf/2110.15128.pdf.
☆22Jan 12, 2022Updated 4 years ago
adarobustness / adaptation_robustness
View on GitHub
Evaluate robustness of adaptation methods on large vision-language models
☆19Aug 23, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
BMC-SDNU / Cross-Modal-Retrieval
View on GitHub
Cross-Modal-Real-valuded-Retrieval
☆88Jul 18, 2023Updated 3 years ago
mwray / Semantic-Video-Retrieval
View on GitHub
Code and benchmarks for the Semantic Video Retrieval Task
☆53Oct 18, 2022Updated 3 years ago
anakin-skywalker-Joseph / Folder
View on GitHub
Official Implementation of Paper FOLDER (ICCV2025) and Turbo (ECCV2024)
☆15Jun 27, 2025Updated last year
CuthbertCai / Ask-Confirm
View on GitHub
Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)
☆20Dec 4, 2021Updated 4 years ago
williamium3000 / awesome-mllm-grounding
View on GitHub
Awesome paper for multi-modal llm with grounding ability
☆21Oct 11, 2025Updated 9 months ago
aysebilgegunduz / ShotBoundaryDetection
View on GitHub
Detects shot boundaries from news with K-Means. Using Bhattacharya Coefficient for distance.
☆10Jun 1, 2017Updated 9 years ago
huhengtong / UKD_CVPR2020
View on GitHub
The source code for the CVPR2020 paper "Creating Something from Nothing: Unsupervised Knowledge Distillation for Cross-Modal Hashing".
☆24Oct 10, 2020Updated 5 years ago
fhlt / shot_boundary_detection
View on GitHub
shot_boundary_detection
☆10Nov 26, 2019Updated 6 years ago
liuting20 / DARA
View on GitHub
[ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding
☆22Feb 26, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
cwj1412 / MSCOCO-Flikcr30K_FG
View on GitHub
Benchmark data for "Rethinking Benchmarks for Cross-modal Image-text Retrieval" (SIGIR 2023)
☆28Apr 24, 2023Updated 3 years ago
princetonvisualai / MQVR
View on GitHub
☆26Jan 12, 2022Updated 4 years ago
XLearning-SCU / 2021-NeurIPS-NCR
View on GitHub
☆82Nov 6, 2023Updated 2 years ago
zhengzangw / DoPrompt
View on GitHub
Official implementation of PCS in essay "Prompt Vision Transformer for Domain Generalization"
☆48Jan 29, 2023Updated 3 years ago
ioanacroi / qb-norm
View on GitHub
Cross Modal Retrieval with Querybank Normalisation
☆57Nov 21, 2023Updated 2 years ago
gimpong / WWW22-HCQ
View on GitHub
The code for the paper "Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval" (WWW'22, Oral).
☆17Mar 8, 2022Updated 4 years ago
zhangy0822 / USER
View on GitHub
USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval, TIP 2024
☆33Jun 18, 2025Updated last year