ruc-aimc-lab/TeachCLIP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ruc-aimc-lab/TeachCLIP)

ruc-aimc-lab / TeachCLIP

[CVPR 2024] TeachCLIP for Text-to-Video Retrieval

☆42

Alternatives and similar repositories for TeachCLIP

Users that are interested in TeachCLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

adxcreative / D-M
View on GitHub
The official source code of our AAAI25 paper "D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matchin…
☆10Feb 9, 2025Updated last year
patrick-0817 / T-MASS-dataleakage
View on GitHub
☆10Nov 27, 2024Updated last year
xxayt / MGSV
View on GitHub
[ICCV 2025] This repo is the official implementation of "Music Grounding by Short Video"
☆27Sep 9, 2025Updated 10 months ago
Ziyang412 / UCoFiA
View on GitHub
Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)
☆66Jun 7, 2024Updated 2 years ago
xuguohai / X-CLIP
View on GitHub
An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"
☆185Apr 6, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
bladewaltz1 / PromptSwitch
View on GitHub
☆30Aug 14, 2023Updated 2 years ago
layer6ai-labs / xpool
View on GitHub
https://layer6ai-labs.github.io/xpool/
☆137Jul 1, 2023Updated 3 years ago
whwu95 / Cap4Video
View on GitHub
【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
☆256Nov 29, 2024Updated last year
musicman217 / Text-Proxy
View on GitHub
Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval -- AAAI2025
☆21May 8, 2026Updated 2 months ago
jiazhen-code / PhD
View on GitHub
[CVPR25 Highlight] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced eval…
☆32Apr 16, 2025Updated last year
OmkarThawakar / composed-video-retrieval
View on GitHub
Composed Video Retrieval
☆62May 2, 2024Updated 2 years ago
LunarShen / TempMe
View on GitHub
[ICLR 2025] TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval
☆27Feb 13, 2025Updated last year
tuyunbin / Review-of-Change-Captioning
View on GitHub
This repository offers a comprehensive overview of existing datasets and methods in the field of change captioning.
☆17Sep 2, 2025Updated 10 months ago
foolwood / DRL
View on GitHub
[arXiv22] Disentangled Representation Learning for Text-Video Retrieval
☆96Apr 7, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
leolee99 / PAU
View on GitHub
[NeurIPS 2023] The official implementation of paper "Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval" acce…
☆28May 14, 2024Updated 2 years ago
LunarShen / DsicoVLA
View on GitHub
[CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval
☆22Jun 23, 2025Updated last year
LeeHyuck / CDMAD
View on GitHub
Code of the CDMAD: Class-Distribution-Mismatch-Aware Debiasing for Class-Imbalanced Semi-Supervised Learning (2024 CVPR accepted paper)
☆21Mar 18, 2024Updated 2 years ago
sudo-Boris / mr-Blip
View on GitHub
Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"
☆95Mar 9, 2025Updated last year
hrtang22 / MUSE
View on GitHub
Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"
☆26Feb 2, 2025Updated last year
zyxia1009 / CVPR2024-TSPNet
View on GitHub
(CVPR2024) Realigning Confidence with Temporal Saliency Information for Point-level Weakly-Supervised Temporal Action Localization
☆20Jun 11, 2024Updated 2 years ago
1180300419 / imperfect-deweathering
View on GitHub
[AAAI 2024] Official pytorch implementation of “Learning Real-World Image De-Weathering with Imperfect Supervision”
☆17Aug 22, 2024Updated last year
Mehrdad-Noori / WATT
View on GitHub
[NeurIPS 2024] WATT: Weight Average Test-Time Adaptation of CLIP
☆58Sep 26, 2024Updated last year
jinhyunj / EaTR
View on GitHub
Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)
☆55Sep 7, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
HuiGuanLab / DL-DKD
View on GitHub
Source code of the paper Dual Learning with Dynamic Knowledge Distillation and Soft Alignment for Partially Relevant Video Retrieval
☆19May 13, 2026Updated 2 months ago
patrick-0817 / T-MASS-text-video-retrieval
View on GitHub
Note: DO NOT USE IT! THIS CODE IS PROVEN TO CONTAIN DATA LEAKAGE! Archive version of "Text Is MASS: Modeling as Stochastic Embedding for …
☆23May 1, 2025Updated last year
knightyxp / DGL
View on GitHub
[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.
☆49Oct 14, 2024Updated last year
kevinliang888 / IVR-QA-baselines
View on GitHub
[ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers
☆20Apr 16, 2024Updated 2 years ago
maifoundations / Streamo
View on GitHub
Streaming Video Instruction Tuning
☆82Feb 25, 2026Updated 5 months ago
naver-ai / pcmepp
View on GitHub
Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)
☆64May 26, 2024Updated 2 years ago
farewellthree / STAN
View on GitHub
Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"
☆107Jan 28, 2024Updated 2 years ago
HKUST-LongGroup / DyME
View on GitHub
[ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration
☆18Mar 18, 2026Updated 4 months ago
Hritikbansal / videocon
View on GitHub
☆58Apr 24, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
adxcreative / COPE
View on GitHub
☆15Dec 20, 2024Updated last year
Jam1ezhang / RankCLIP
View on GitHub
Ranking-Consistent Language-Image Pretraining
☆15Oct 24, 2025Updated 9 months ago
jpthu17 / DiffusionRet
View on GitHub
[ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
☆141Apr 9, 2024Updated 2 years ago
TheEighthDay / SeekWorld
View on GitHub
The first attempt to replicate o3-like visual clue-tracking reasoning capabilities.
☆64Jul 8, 2025Updated last year
zhangbw17 / MV-Adapter
View on GitHub
An official pytorch implementation of the paper: [MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval].
☆14Jul 27, 2024Updated last year
dreamhunter2333 / ikun-whacamole
View on GitHub
打地鼠 - 鸡你太美 ikun 版
☆10Dec 16, 2022Updated 3 years ago
yhy-2000 / MomentSeeker
View on GitHub
☆23Jul 23, 2025Updated last year