UKPLab/MMT-Retrieval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UKPLab/MMT-Retrieval)

UKPLab / MMT-Retrieval

☆131

Alternatives and similar repositories for MMT-Retrieval

Users that are interested in MMT-Retrieval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

intersun / LightningDOT
View on GitHub
source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT
☆72Nov 14, 2022Updated 3 years ago
sunnychencool / AOQ
View on GitHub
Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)
☆34Jul 2, 2020Updated 6 years ago
nreimers / beir-sparta
View on GitHub
Re-Implementation of SPARTA model
☆13Oct 1, 2021Updated 4 years ago
google-research-datasets / MultiReQA
View on GitHub
We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…
☆31Jul 9, 2020Updated 6 years ago
willard-yuan / video-text-retrieval-papers
View on GitHub
☆15Sep 16, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jayleicn / mTVRetrieval
View on GitHub
[ACL 2021] mTVR: Multilingual Video Moment Retrieval
☆27Aug 20, 2022Updated 3 years ago
hardyqr / HAL
View on GitHub
[AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".
☆38Oct 4, 2023Updated 2 years ago
BruceW91 / CVSE
View on GitHub
The official source code for the paper Consensus-Aware Visual-Semantic Embedding for Image-Text Matching (ECCV 2020)
☆168Feb 7, 2022Updated 4 years ago
jayleicn / ClipBERT
View on GitHub
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…
☆730Aug 8, 2023Updated 2 years ago
zinengtang / VidLanKD
View on GitHub
Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))
☆56Feb 6, 2023Updated 3 years ago
mwray / Semantic-Video-Retrieval
View on GitHub
Code and benchmarks for the Semantic Video Retrieval Task
☆53Oct 18, 2022Updated 3 years ago
kywen1119 / DSRAN
View on GitHub
Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.
☆74Oct 25, 2022Updated 3 years ago
Paranioar / Awesome_Matching_Pretraining_Transfering
View on GitHub
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretr…
☆446Sep 25, 2025Updated 9 months ago
CryhanFang / CLIP2Video
View on GitHub
☆260Dec 10, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ChenRocks / UNITER
View on GitHub
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
☆800Jun 30, 2021Updated 5 years ago
woodfrog / vse_infty
View on GitHub
Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)
☆165Aug 24, 2025Updated 10 months ago
yalesong / pvse
View on GitHub
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)
☆135Mar 15, 2024Updated 2 years ago
antoine77340 / MIL-NCE_HowTo100M
View on GitHub
PyTorch GPU distributed training code for MIL-NCE HowTo100M
☆221Jul 5, 2022Updated 4 years ago
AndresPMD / semantic_adaptive_margin
View on GitHub
WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
☆16Dec 10, 2021Updated 4 years ago
niluthpol / multimodal_vtt
View on GitHub
Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval
☆68Apr 10, 2020Updated 6 years ago
CrossmodalGroup / BFAN
View on GitHub
Implementation of our ACMMM2019 paper, Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching
☆39Jun 19, 2023Updated 3 years ago
cshizhe / hgr_v2t
View on GitHub
Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".
☆211Jun 12, 2020Updated 6 years ago
yuewang-cuhk / awesome-vision-language-pretraining-papers
View on GitHub
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
☆1,159Aug 19, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
j-min / VL-T5
View on GitHub
PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)
☆372Jul 29, 2023Updated 2 years ago
jayleicn / singularity
View on GitHub
[ACL 2023] Official PyTorch code for Singularity model in "Revealing Single Frame Bias for Video-and-Language Learning"
☆136May 5, 2023Updated 3 years ago
easonnie / mlp-vil
View on GitHub
MLPs for Vision and Langauge Modeling (Coming Soon)
☆27Dec 9, 2021Updated 4 years ago
studio-ousia / bpr
View on GitHub
Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering
☆175Jun 6, 2021Updated 5 years ago
microsoft / M3P
View on GitHub
Multitask Multilingual Multimodal Pre-training
☆72Nov 27, 2022Updated 3 years ago
google-research-datasets / wit
View on GitHub
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique imag…
☆1,113Sep 27, 2024Updated last year
zhegan27 / VILLA
View on GitHub
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": UNITER…
☆119Jan 13, 2021Updated 5 years ago
xuewyang / Fashion_Captioning
View on GitHub
ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.
☆85Jun 22, 2023Updated 3 years ago
pzzhang / VinVL
View on GitHub
project page for VinVL
☆360Jul 26, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
facebookresearch / vilbert-multi-task
View on GitHub
Multi Task Vision and Language
☆824Feb 16, 2022Updated 4 years ago
uta-smile / TCL
View on GitHub
code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022
☆271Oct 2, 2024Updated last year
microsoft / Oscar
View on GitHub
Oscar and VinVL
☆1,054Aug 28, 2023Updated 2 years ago
gabeur / mmt
View on GitHub
Multi-Modal Transformer for Video Retrieval
☆265Oct 9, 2024Updated last year
naver-ai / pcme
View on GitHub
Official Pytorch implementation of "Probabilistic Cross-Modal Embedding" (CVPR 2021)
☆141Mar 1, 2024Updated 2 years ago
airsplay / vokenization
View on GitHub
PyTorch code for EMNLP 2020 Paper "Vokenization: Improving Language Understanding with Visual Supervision"
☆191Mar 8, 2021Updated 5 years ago
ZihaoWang-CV / CAMP_iccv19
View on GitHub
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
☆127Feb 26, 2020Updated 6 years ago