CryhanFang/CLIP2Video

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CryhanFang/CLIP2Video)

CryhanFang / CLIP2Video

☆260

Alternatives and similar repositories for CLIP2Video

Users that are interested in CLIP2Video are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ArrowLuo / CLIP4Clip
View on GitHub
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
☆1,028Apr 12, 2024Updated 2 years ago
m-bain / frozen-in-time
View on GitHub
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
☆377May 19, 2022Updated 4 years ago
Deferf / CLIP_Video_Representation
View on GitHub
Use CLIP to represent video for Retrieval Task
☆71Mar 1, 2021Updated 5 years ago
starmemda / CAMoE
View on GitHub
☆101Sep 27, 2021Updated 4 years ago
jayleicn / ClipBERT
View on GitHub
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…
☆730Aug 8, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
danieljf24 / awesome-video-text-retrieval
View on GitHub
A curated list of deep learning resources for video-text retrieval.
☆645Oct 20, 2023Updated 2 years ago
albanie / collaborative-experts
View on GitHub
Video embeddings for retrieval with natural language queries
☆344Feb 15, 2023Updated 3 years ago
foolwood / DRL
View on GitHub
[arXiv22] Disentangled Representation Learning for Text-Video Retrieval
☆97Apr 7, 2022Updated 4 years ago
mzhaoshuai / CenterCLIP
View on GitHub
[SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval.
☆135May 4, 2022Updated 4 years ago
gabeur / mmt
View on GitHub
Multi-Modal Transformer for Video Retrieval
☆265Oct 9, 2024Updated last year
mwray / Semantic-Video-Retrieval
View on GitHub
Code and benchmarks for the Semantic Video Retrieval Task
☆53Oct 18, 2022Updated 3 years ago
layer6ai-labs / xpool
View on GitHub
https://layer6ai-labs.github.io/xpool/
☆138Jul 1, 2023Updated 3 years ago
yawenzeng / Awesome-Cross-Modal-Video-Moment-Retrieval
View on GitHub
前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。
☆265Aug 26, 2023Updated 2 years ago
VALUE-Leaderboard / StarterCode
View on GitHub
Starter Code for VALUE benchmark
☆79Aug 23, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
cshizhe / hgr_v2t
View on GitHub
Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".
☆211Jun 12, 2020Updated 6 years ago
danieljf24 / hybrid_space
View on GitHub
Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrie…
☆88Jan 10, 2023Updated 3 years ago
jayleicn / singularity
View on GitHub
[ACL 2023] Official PyTorch code for Singularity model in "Revealing Single Frame Bias for Video-and-Language Learning"
☆136May 5, 2023Updated 3 years ago
TencentARC / MCQ
View on GitHub
Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).
☆141Jul 20, 2022Updated 4 years ago
sallymmx / ActionCLIP
View on GitHub
This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"
☆613Dec 6, 2023Updated 2 years ago
li-xirong / avs
View on GitHub
Ad-hoc Video Search
☆29Feb 18, 2021Updated 5 years ago
ioanacroi / qb-norm
View on GitHub
Cross Modal Retrieval with Querybank Normalisation
☆57Nov 21, 2023Updated 2 years ago
xuguohai / X-CLIP
View on GitHub
An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"
☆185Apr 6, 2024Updated 2 years ago
HuiGuanLab / ms-sl
View on GitHub
Source code of our MM'22 paper Partially Relevant Video Retrieval
☆57Nov 4, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
linjieli222 / HERO
View on GitHub
Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
☆235Sep 16, 2021Updated 4 years ago
microsoft / UniVL
View on GitHub
An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"
☆365Jul 25, 2024Updated last year
LiuRicky / ts2_net
View on GitHub
[ECCV 2022] A pytorch implementation for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
☆80Nov 29, 2022Updated 3 years ago
danieljf24 / dual_encoding
View on GitHub
[CVPR2019] Dual Encoding for Zero-Example Video Retrieval
☆153Jan 10, 2023Updated 3 years ago
papermsucode / mdmmt
View on GitHub
MDMMT: Multidomain Multimodal Transformer for Video Retrieval
☆26Jun 28, 2021Updated 5 years ago
minghangz / cpl
View on GitHub
CPL: Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning
☆65Mar 22, 2026Updated 3 months ago
Roc-Ng / HANet
View on GitHub
PyTorch implementation of HANet: Hierarchical Alignment Networks for Video-Text Retrieval (ACM MM 2021).
☆47Aug 19, 2021Updated 4 years ago
princetonvisualai / MQVR
View on GitHub
☆26Jan 12, 2022Updated 4 years ago
amazon-science / video-contrastive-learning
View on GitHub
Video Contrastive Learning with Global Context, ICCVW 2021
☆162May 30, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
UKPLab / MMT-Retrieval
View on GitHub
☆131Dec 10, 2022Updated 3 years ago
salesforce / ALPRO
View on GitHub
Align and Prompt: Video-and-Language Pre-training with Entity Prompts
☆188May 1, 2025Updated last year
FingerRec / OA-Transformer
View on GitHub
[CVPR 2022] The code for our paper 《Object-aware Video-language Pre-training for Retrieval》
☆61May 25, 2022Updated 4 years ago
jayleicn / mTVRetrieval
View on GitHub
[ACL 2021] mTVR: Multilingual Video Moment Retrieval
☆27Aug 20, 2022Updated 3 years ago
simon-ging / coot-videotext
View on GitHub
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
☆291Sep 6, 2022Updated 3 years ago
mengcaopku / LocVTP
View on GitHub
[ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization
☆39Jul 29, 2022Updated 3 years ago
salesforce / ALBEF
View on GitHub
Code for ALBEF: a new vision-language pre-training method
☆1,757Sep 20, 2022Updated 3 years ago