patrick-0817 / T-MASS-text-video-retrievalView external linksLinks
Note: DO NOT USE IT! THIS CODE IS PROVEN TO CONTAIN DATA LEAKAGE! Archive version of "Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval (CVPR 2024 Highlight)"
☆21May 1, 2025Updated 9 months ago
Alternatives and similar repositories for T-MASS-text-video-retrieval
Users that are interested in T-MASS-text-video-retrieval are comparing it to the libraries listed below
Sorting:
- ☆10Nov 27, 2024Updated last year
- Official github repo for ICCV2023 paper 'Multi-event Video-Text Retrieval'☆19Feb 16, 2024Updated last year
- [ICCV 2025] Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.☆49Dec 10, 2025Updated 2 months ago
- [CVPR 2024] TeachCLIP for Text-to-Video Retrieval☆42May 7, 2025Updated 9 months ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- ☆10May 4, 2018Updated 7 years ago
- Code for "Weakly-supervised Fingerspelling Recognition in British Sign Language Videos", BMVC 2022.☆12Jun 22, 2023Updated 2 years ago
- [NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference☆17Jun 19, 2025Updated 7 months ago
- ☆19May 6, 2024Updated last year
- A pytorch implementation of our paper Image Captioning with Inherent Sentiment (ICME 2021 Oral).☆11Jul 18, 2022Updated 3 years ago
- official code for "3D Question Answering via only 2D Vision-Language Models"☆23Jan 15, 2026Updated last month
- [CVPR 2025] Official Pytorch implementation of "Learning with Noisy Triplet Correspondence for Composed Image Retrieval".☆22Jun 9, 2025Updated 8 months ago
- [EMNLP 2024] IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning☆15May 13, 2025Updated 9 months ago
- ✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Model…☆18Mar 13, 2025Updated 11 months ago
- [ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination☆19Jan 27, 2025Updated last year
- Source code of our TMM 2019 paper "Multi-pathway Generative Adversarial Hashing for Unsupervised Cross-modal Retrieval"☆12Jun 17, 2019Updated 6 years ago
- 力扣题单hot100的ACM模式实现☆21Sep 2, 2025Updated 5 months ago
- Official GitHub repository for the paper "Adversarial Attacks on Robotic Vision Language Action Models"☆29May 28, 2025Updated 8 months ago
- [ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model☆140Apr 9, 2024Updated last year
- [SIGIR'2024 Best Paper Honorable Mention] Official repository for "LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Compose…☆72Mar 14, 2025Updated 11 months ago
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆21Oct 8, 2024Updated last year
- Official code for "Federated Weakly Supervised Video Anomaly Detection with Multimodal Prompt" (AAAI2025)☆25May 27, 2025Updated 8 months ago
- [ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers☆18Apr 16, 2024Updated last year
- Official resource for paper Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models (ACL 20…☆15Aug 12, 2024Updated last year
- [AAAI-26] Are We on the Right Way for Assessing Document Retrieval-Augmented Generation?☆26Dec 14, 2025Updated 2 months ago
- implementation of BiGAN model using tensorflow☆16Oct 2, 2018Updated 7 years ago
- ☆21Nov 27, 2025Updated 2 months ago
- ☆21Oct 9, 2025Updated 4 months ago
- Code repository for CVPR2024 paper 《Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial Robustness》☆25May 29, 2024Updated last year
- ☆20Jul 1, 2025Updated 7 months ago
- [ECCV 2024 (Oral)] Towards Scene Graph Anticipation☆19Nov 25, 2024Updated last year
- [AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval☆20May 10, 2024Updated last year
- ☆64Oct 12, 2025Updated 4 months ago
- Official PyTorch Implementation for the "What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-mod…☆19Sep 26, 2024Updated last year
- Official PyTorch implementation Source code for Weakly Supervised Video Scene Graph Generation via Natural Language Supervision, accepted…☆23Jun 13, 2025Updated 8 months ago
- Code implementation for the paper "Large-scale Pre-training for Grounded Video Caption Generation" (ICCV 2025)☆28Jan 18, 2026Updated 3 weeks ago
- ICCV'23 Dual Learning with Dynamic Knowledge Distillation for Partially Relevant Video Retrieval☆19Aug 22, 2025Updated 5 months ago
- Chest X-Ray Explainer (ChEX)☆23Jan 30, 2025Updated last year
- PyTorch Implementation for CoKe☆15Apr 6, 2022Updated 3 years ago