Note: DO NOT USE IT! THIS CODE IS PROVEN TO CONTAIN DATA LEAKAGE! Archive version of "Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval (CVPR 2024 Highlight)"
☆23May 1, 2025Updated 11 months ago
Alternatives and similar repositories for T-MASS-text-video-retrieval
Users that are interested in T-MASS-text-video-retrieval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Nov 27, 2024Updated last year
- [ICCV 2025] Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.☆53Mar 17, 2026Updated last month
- Official github repo for ICCV2023 paper 'Multi-event Video-Text Retrieval'☆20Feb 16, 2024Updated 2 years ago
- official code for "3D Question Answering via only 2D Vision-Language Models"☆24Mar 4, 2026Updated last month
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆22Oct 8, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆18May 6, 2024Updated last year
- Parameter-efficient Fine Tuning for Clinical LLMs☆17Apr 23, 2024Updated last year
- [CVPR 2025] Official Pytorch implementation of "Learning with Noisy Triplet Correspondence for Composed Image Retrieval".☆24Jun 9, 2025Updated 10 months ago
- [ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model☆140Apr 9, 2024Updated 2 years ago
- [EMNLP 2024] IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning☆15May 13, 2025Updated 11 months ago
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆23Nov 12, 2025Updated 5 months ago
- Robot Agent using VLLMs to make long horizon plans. Isaac Sim, BEHAVIOR, Robosuite☆13Aug 31, 2025Updated 7 months ago
- [SIGIR'2024 Best Paper Honorable Mention] Official repository for "LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Compose…☆73Mar 14, 2025Updated last year
- [CVPR 2024] Official PyTorch implementation of the paper "One For All: Video Conversation is Feasible Without Video Instruction Tuning"☆35Feb 2, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆21Oct 9, 2025Updated 6 months ago
- code for paper `MemCap: Memorizing Style Knowledge for Image Captioning`☆11Mar 17, 2020Updated 6 years ago
- Code repository for CVPR2024 paper 《Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial Robustness》☆25May 29, 2024Updated last year
- Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"☆107Jan 28, 2024Updated 2 years ago
- Official GitHub repository for the paper "Adversarial Attacks on Robotic Vision Language Action Models"☆33May 28, 2025Updated 10 months ago
- A prototype implementation in python of the paper "Cross-View Tracking for Multi-Human 3D Pose Estimation at over 100 FPS ", with inverse …☆15Apr 2, 2022Updated 4 years ago
- PyTorch Implementation for CoKe☆15Apr 6, 2022Updated 4 years ago
- ☆66Dec 29, 2025Updated 3 months ago
- [AAAI 2026 Oral] HiMo-CLIP: Modeling Semantic Hierarchy and Monotonicity in Vision-Language Alignment☆30Dec 17, 2025Updated 4 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The papers of Deepfakes Detection.☆23Feb 3, 2021Updated 5 years ago
- Source code of our TMM 2019 paper "Multi-pathway Generative Adversarial Hashing for Unsupervised Cross-modal Retrieval"☆12Jun 17, 2019Updated 6 years ago
- [ICCV2021] Official Pytorch implementation for SDGZSL (Semantics Disentangling for Generalized Zero-Shot Learning)☆40Dec 16, 2021Updated 4 years ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- ☆23Jul 1, 2025Updated 9 months ago
- implementation of BiGAN model using tensorflow☆16Oct 2, 2018Updated 7 years ago
- [ICML 2022] Generalizing to Evolving Domains with Latent Structure-Aware Sequential Autoencoder☆23Feb 25, 2024Updated 2 years ago
- [ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers☆19Apr 16, 2024Updated 2 years ago
- This is a self-made dataset designed for drowning detection.☆34Jul 27, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repository contains code for the paper "Fine-Grained Predicates Learning for Scene Graph Generation (CVPR 2022)".☆26Jun 7, 2024Updated last year
- [IEEE TMM'25] Scene-Text Grounding for Text-Based Video Question Answering☆17Feb 16, 2026Updated 2 months ago
- [AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval☆20May 10, 2024Updated last year
- Released code and data for "Frame-Transformer Emotion Classification Network." ICMR 2017☆17Jun 17, 2017Updated 8 years ago
- ☆31Apr 9, 2026Updated last week
- ✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Model…☆20Mar 13, 2025Updated last year
- ICCV'23 Dual Learning with Dynamic Knowledge Distillation for Partially Relevant Video Retrieval☆19Aug 22, 2025Updated 7 months ago