patrick-0817/T-MASS-text-video-retrieval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/patrick-0817/T-MASS-text-video-retrieval)

patrick-0817 / T-MASS-text-video-retrieval

Note: DO NOT USE IT! THIS CODE IS PROVEN TO CONTAIN DATA LEAKAGE! Archive version of "Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval (CVPR 2024 Highlight)"

☆23

Alternatives and similar repositories for T-MASS-text-video-retrieval

Users that are interested in T-MASS-text-video-retrieval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

patrick-0817 / T-MASS-dataleakage
View on GitHub
☆10Nov 27, 2024Updated last year
JasonCodeMaker / CTVR
View on GitHub
☆16Jun 2, 2025Updated last year
invhun / NarVid
View on GitHub
[CVPR'2025] Narrating the Video: Boosting Text-Video Retrieval via Comprehensive Utilization of Frame-Level Captions
☆19Jan 16, 2026Updated 6 months ago
gengyuanmax / MeVTR
View on GitHub
Official github repo for ICCV2023 paper 'Multi-event Video-Text Retrieval'
☆20Feb 16, 2024Updated 2 years ago
jason-lim26 / DiPEx
View on GitHub
Official PyTorch implementation of our paper "Dispersing Prompt Expansion for Class-Agnostic Object Detection" (NeurIPS 2024)
☆14Jan 19, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
li-shuxian / TME
View on GitHub
[CVPR 2025] Official Pytorch implementation of "Learning with Noisy Triplet Correspondence for Composed Image Retrieval".
☆27Jun 9, 2025Updated last year
Lilidamowang / T2VIndexer-generativeSearch
View on GitHub
☆16Aug 28, 2024Updated last year
musicman217 / Text-Proxy
View on GitHub
Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval -- AAAI2025
☆21May 8, 2026Updated 2 months ago
ruc-aimc-lab / TeachCLIP
View on GitHub
[CVPR 2024] TeachCLIP for Text-to-Video Retrieval
☆42May 7, 2025Updated last year
prajwalkr / transpeller
View on GitHub
Code for "Weakly-supervised Fingerspelling Recognition in British Sign Language Videos", BMVC 2022.
☆12Jun 22, 2023Updated 3 years ago
lijun2005 / ICCV25-HLFormer
View on GitHub
[ICCV 2025] Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning
☆62May 11, 2026Updated 2 months ago
echogarden-project / text-segmentation
View on GitHub
A library for multilingual word, phrase and sentence segmentation.
☆16Updated this week
CSC2548 / image_caption_gan
View on GitHub
☆10May 4, 2018Updated 8 years ago
amazon-science / slang-llm-benchmark
View on GitHub
☆19May 6, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
farewellthree / BT-Adapter
View on GitHub
[CVPR 2024] Official PyTorch implementation of the paper "One For All: Video Conversation is Feasible Without Video Instruction Tuning"
☆35Feb 2, 2024Updated 2 years ago
entalent / MemCap
View on GitHub
code for paper `MemCap: Memorizing Style Knowledge for Image Captioning`
☆11Mar 17, 2020Updated 6 years ago
boreng0817 / IFCap
View on GitHub
[EMNLP 2024] IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning
☆15May 13, 2025Updated last year
farewellthree / STAN
View on GitHub
Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"
☆107Jan 28, 2024Updated 2 years ago
Pter61 / osrcir
View on GitHub
Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval [CVPR 2025 Highlight]
☆72Jul 8, 2025Updated last year
PKU-ICST-MIPL / MGAH_TMM2019
View on GitHub
Source code of our TMM 2019 paper "Multi-pathway Generative Adversarial Hashing for Unsupervised Cross-modal Retrieval"
☆12Jun 17, 2019Updated 7 years ago
mainaksingha01 / ODG-CLIP
View on GitHub
☆21Oct 9, 2025Updated 9 months ago
uniglot / korean-word-ipa-dictionary
View on GitHub
Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)
☆23Nov 12, 2025Updated 8 months ago
bofang98 / UATVR
View on GitHub
[ICCV'23] UATVR: Uncertainty-Adaptive Text-Video Retrieval
☆13Nov 5, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ytaek-oh / fsc-clip
View on GitHub
[EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
☆24Oct 8, 2024Updated last year
idstcv / CoKe
View on GitHub
PyTorch Implementation for CoKe
☆15Apr 6, 2022Updated 4 years ago
mlvlab / BLiM
View on GitHub
Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)
☆26Aug 1, 2025Updated 11 months ago
Ziyang412 / Video-RTS
View on GitHub
Code for EMNLP25 paper "Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning"
☆24Feb 18, 2026Updated 5 months ago
Liyan06 / ChartMuseum
View on GitHub
[NeurIPS 2025] ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models
☆24Apr 20, 2026Updated 3 months ago
codezakh / DataEnvGym
View on GitHub
[ICLR 25 Spotlight] A testbed for agents and environments that can automatically improve models through data generation.
☆32Mar 4, 2025Updated last year
chenshen03 / Deepfakes-Detection-Papers
View on GitHub
The papers of Deepfakes Detection.
☆23Feb 3, 2021Updated 5 years ago
kevinliang888 / IVR-QA-baselines
View on GitHub
[ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers
☆20Apr 16, 2024Updated 2 years ago
masataka46 / BiGAN
View on GitHub
implementation of BiGAN model using tensorflow
☆16Oct 2, 2018Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
huangmozhi9527 / GMMFormer
View on GitHub
[AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval
☆21May 10, 2024Updated 2 years ago
zhousheng97 / ViTXT-GQA
View on GitHub
[IEEE TMM'25] Scene-Text Grounding for Text-Based Video Question Answering
☆17Feb 16, 2026Updated 5 months ago
TerryPei / CSP
View on GitHub
Cross-Self KV Cache Pruning for Efficient Vision-Language Inference
☆10Dec 15, 2024Updated last year
loscheris / VideoCaptioning_att
View on GitHub
A video captioning tool using S2VT method and attention mechanism (TensorFlow)
☆15Oct 14, 2018Updated 7 years ago
vla-attack / tex3d
View on GitHub
☆19Jul 20, 2026Updated last week
art-jang / LiTFiC
View on GitHub
[CVPR2025] Official code for Lost in Translation Found in Context
☆24Jan 14, 2026Updated 6 months ago
kittenish / Frame-Transformer-Network
View on GitHub
Released code and data for "Frame-Transformer Emotion Classification Network." ICMR 2017
☆17Jun 17, 2017Updated 9 years ago