Note: DO NOT USE IT! THIS CODE IS PROVEN TO CONTAIN DATA LEAKAGE! Archive version of "Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval (CVPR 2024 Highlight)"
☆23May 1, 2025Updated last year
Alternatives and similar repositories for T-MASS-text-video-retrieval
Users that are interested in T-MASS-text-video-retrieval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2025] Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning☆60May 11, 2026Updated last month
- Official github repo for ICCV2023 paper 'Multi-event Video-Text Retrieval'☆20Feb 16, 2024Updated 2 years ago
- [CVPR 2024] TeachCLIP for Text-to-Video Retrieval☆42May 7, 2025Updated last year
- Code for "Weakly-supervised Fingerspelling Recognition in British Sign Language Videos", BMVC 2022.☆12Jun 22, 2023Updated 2 years ago
- official code for "3D Question Answering via only 2D Vision-Language Models"☆24Mar 4, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆22Oct 8, 2024Updated last year
- ☆19May 6, 2024Updated 2 years ago
- ☆10May 4, 2018Updated 8 years ago
- [EMNLP 2024] IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning☆14May 13, 2025Updated last year
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆23Nov 12, 2025Updated 7 months ago
- [SIGIR'2024 Best Paper Honorable Mention] Official repository for "LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Compose…☆74Mar 14, 2025Updated last year
- A pytorch implementation of our paper Image Captioning with Inherent Sentiment (ICME 2021 Oral).☆11Jul 18, 2022Updated 3 years ago
- [CVPR 2024] Official PyTorch implementation of the paper "One For All: Video Conversation is Feasible Without Video Instruction Tuning"☆35Feb 2, 2024Updated 2 years ago
- ☆21Oct 9, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- code for paper `MemCap: Memorizing Style Knowledge for Image Captioning`☆11Mar 17, 2020Updated 6 years ago
- Code repository for CVPR2024 paper 《Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial Robustness》☆25May 29, 2024Updated 2 years ago
- Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"☆107Jan 28, 2024Updated 2 years ago
- Official GitHub repository for the paper "Adversarial Attacks on Robotic Vision Language Action Models"☆35May 28, 2025Updated last year
- A prototype implementation in python of the paper "Cross-View Tracking for Multi-Human 3D Pose Estimation at over 100 FPS ", with inverse…☆15Apr 2, 2022Updated 4 years ago
- ☆68Dec 29, 2025Updated 5 months ago
- [AAAI 2026 Oral] HiMo-CLIP: Modeling Semantic Hierarchy and Monotonicity in Vision-Language Alignment☆29Dec 17, 2025Updated 6 months ago
- The papers of Deepfakes Detection.☆23Feb 3, 2021Updated 5 years ago
- [ICCV'23] UATVR: Uncertainty-Adaptive Text-Video Retrieval☆13Nov 5, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Source code of our TMM 2019 paper "Multi-pathway Generative Adversarial Hashing for Unsupervised Cross-modal Retrieval"☆12Jun 17, 2019Updated 7 years ago
- [ICCV2021] Official Pytorch implementation for SDGZSL (Semantics Disentangling for Generalized Zero-Shot Learning)☆40Dec 16, 2021Updated 4 years ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- ☆24Jul 1, 2025Updated 11 months ago
- implementation of BiGAN model using tensorflow☆16Oct 2, 2018Updated 7 years ago
- [ICML 2022] Generalizing to Evolving Domains with Latent Structure-Aware Sequential Autoencoder☆23Feb 25, 2024Updated 2 years ago
- [ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers☆20Apr 16, 2024Updated 2 years ago
- This is a self-made dataset designed for drowning detection.☆34Jul 27, 2024Updated last year
- This repository contains code for the paper "Fine-Grained Predicates Learning for Scene Graph Generation (CVPR 2022)".☆26Jun 7, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [IEEE TMM'25] Scene-Text Grounding for Text-Based Video Question Answering☆17Feb 16, 2026Updated 4 months ago
- [AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval☆20May 10, 2024Updated 2 years ago
- ☆27Jan 4, 2023Updated 3 years ago
- Reidentifying people across a multi-camera environment and detecting their poses, all in real-time.☆23Sep 28, 2022Updated 3 years ago
- Released code and data for "Frame-Transformer Emotion Classification Network." ICMR 2017☆17Jun 17, 2017Updated 9 years ago
- ✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Model…☆21Mar 13, 2025Updated last year
- Source code of the paper Dual Learning with Dynamic Knowledge Distillation and Soft Alignment for Partially Relevant Video Retrieval☆19May 13, 2026Updated last month