MDMMT: Multidomain Multimodal Transformer for Video Retrieval
☆26Jun 28, 2021Updated 4 years ago
Alternatives and similar repositories for mdmmt
Users that are interested in mdmmt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Mutual Modality Learning code☆15Mar 1, 2021Updated 5 years ago
- Use CLIP to represent video for Retrieval Task☆70Mar 1, 2021Updated 5 years ago
- Code and benchmarks for the Semantic Video Retrieval Task☆53Oct 18, 2022Updated 3 years ago
- ☆31Jun 22, 2022Updated 3 years ago
- Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrie…☆88Jan 10, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆35Mar 22, 2019Updated 7 years ago
- Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".☆211Jun 12, 2020Updated 6 years ago
- Code for the paper "Controllable Video Captioning with an Exemplar Sentence"☆12Apr 14, 2021Updated 5 years ago
- [CVPR2019] Dual Encoding for Zero-Example Video Retrieval☆153Jan 10, 2023Updated 3 years ago
- ☆62May 11, 2021Updated 5 years ago
- Weakly Supervised Video Moment Retrieval from Text Queries☆43Jul 20, 2020Updated 5 years ago
- A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)☆15Oct 18, 2021Updated 4 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- Align and Prompt: Video-and-Language Pre-training with Entity Prompts☆188May 1, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- PyTorch GPU distributed training code for MIL-NCE HowTo100M☆220Jul 5, 2022Updated 3 years ago
- ☆101Sep 27, 2021Updated 4 years ago
- ☆260Dec 10, 2022Updated 3 years ago
- A PyTorch implementation of TVC☆24Dec 18, 2023Updated 2 years ago
- IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning☆79Nov 23, 2020Updated 5 years ago
- [ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval☆163May 28, 2024Updated 2 years ago
- ☆10Jan 3, 2023Updated 3 years ago
- [CVPR2022 Oral] The official code for "TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognit…☆18Aug 1, 2022Updated 3 years ago
- [ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning☆170Dec 4, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Codebase for " Reducing Representation Drift in Online Continual Learning"☆14Jun 8, 2021Updated 5 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆291Sep 6, 2022Updated 3 years ago
- python codes for CIDEr - Consensus-based Image Caption Evaluation☆32Jun 25, 2019Updated 6 years ago
- ☆42Apr 25, 2021Updated 5 years ago
- Deep Learning for Video Retrieval by Natural Language☆11Oct 20, 2019Updated 6 years ago
- Video embeddings for retrieval with natural language queries☆344Feb 15, 2023Updated 3 years ago
- [arXiv22] Disentangled Representation Learning for Text-Video Retrieval☆97Apr 7, 2022Updated 4 years ago
- Code for the HowTo100M paper☆303Mar 10, 2020Updated 6 years ago
- ☆15Mar 20, 2020Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This repo contains all the codes for SEScore implementation☆15Mar 3, 2025Updated last year
- ☆14Jun 22, 2022Updated 3 years ago
- A light-weight data management system for large-scale pretraining☆21May 17, 2025Updated last year
- Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]☆376May 19, 2022Updated 4 years ago
- pytorch implementation of Semantics-AssistedVideoCaptioning☆11Feb 16, 2023Updated 3 years ago
- Multi-Modal Transformer for Video Retrieval☆265Oct 9, 2024Updated last year
- ☆23Aug 21, 2021Updated 4 years ago