MDMMT: Multidomain Multimodal Transformer for Video Retrieval
☆26Jun 28, 2021Updated 4 years ago
Alternatives and similar repositories for mdmmt
Users that are interested in mdmmt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Use CLIP to represent video for Retrieval Task☆70Mar 1, 2021Updated 5 years ago
- Code and benchmarks for the Semantic Video Retrieval Task☆53Oct 18, 2022Updated 3 years ago
- Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrie…☆88Jan 10, 2023Updated 3 years ago
- ☆35Mar 22, 2019Updated 7 years ago
- Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".☆211Jun 12, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for the paper "Controllable Video Captioning with an Exemplar Sentence"☆12Apr 14, 2021Updated 4 years ago
- [CVPR2019] Dual Encoding for Zero-Example Video Retrieval☆153Jan 10, 2023Updated 3 years ago
- ☆62May 11, 2021Updated 4 years ago
- Weakly Supervised Video Moment Retrieval from Text Queries☆43Jul 20, 2020Updated 5 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- Align and Prompt: Video-and-Language Pre-training with Entity Prompts☆188May 1, 2025Updated 10 months ago
- Ad-hoc Video Search☆28Feb 18, 2021Updated 5 years ago
- ☆102Sep 27, 2021Updated 4 years ago
- ☆259Dec 10, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A PyTorch implementation of TVC☆24Dec 18, 2023Updated 2 years ago
- ☆10Jan 3, 2023Updated 3 years ago
- Codebase for " Reducing Representation Drift in Online Continual Learning"☆14Jun 8, 2021Updated 4 years ago
- Dataset for Bilingual VLN☆11Dec 5, 2020Updated 5 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆291Sep 6, 2022Updated 3 years ago
- python codes for CIDEr - Consensus-based Image Caption Evaluation☆32Jun 25, 2019Updated 6 years ago
- ☆42Apr 25, 2021Updated 4 years ago
- Deep Learning for Video Retrieval by Natural Language☆11Oct 20, 2019Updated 6 years ago
- Video embeddings for retrieval with natural language queries☆342Feb 15, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [arXiv22] Disentangled Representation Learning for Text-Video Retrieval☆98Apr 7, 2022Updated 3 years ago
- Code for the HowTo100M paper☆298Mar 10, 2020Updated 6 years ago
- This repo contains all the codes for SEScore implementation☆15Mar 3, 2025Updated last year
- A light-weight data management system for large-scale pretraining☆21May 17, 2025Updated 10 months ago
- Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]☆380May 19, 2022Updated 3 years ago
- Multi-Modal Transformer for Video Retrieval☆265Oct 9, 2024Updated last year
- ☆23Aug 21, 2021Updated 4 years ago
- CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training☆34Nov 9, 2021Updated 4 years ago
- Pipeline to scrape prompt + image url pairs from LAION `share-dalle-3` discord channel☆11Oct 10, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Extension of Self-Supervised Temporal Hashing☆14Apr 15, 2021Updated 4 years ago
- [NeurIPS 2021] Moment-DETR code and QVHighlights dataset☆343Mar 9, 2026Updated 2 weeks ago
- [CVPR 2022] Cross-Architecture Self-supervised Video Representation Learning☆24Jul 5, 2022Updated 3 years ago
- Video Retrieval, 3D ResNet, Triplet Loss, UCF101, Pytorch☆23Sep 8, 2019Updated 6 years ago
- Phrase Localization Evaluation Toolkit☆20Aug 16, 2019Updated 6 years ago
- Published in CVPR 2020; matlab codes☆22Sep 15, 2024Updated last year
- Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*☆30Apr 16, 2021Updated 4 years ago