fartashf/vsepp

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fartashf/vsepp)

fartashf / vsepp

PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"

☆523

Alternatives and similar repositories for vsepp

Users that are interested in vsepp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kuanghuei / SCAN
View on GitHub
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
☆579May 18, 2023Updated 3 years ago
KunpengLi1994 / VSRN
View on GitHub
PyTorch code for ICCV'19 paper "Visual Semantic Reasoning for Image-Text Matching"
☆304Jan 14, 2020Updated 6 years ago
yalesong / pvse
View on GitHub
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)
☆135Mar 15, 2024Updated 2 years ago
ExplorerFreda / VSE-C
View on GitHub
[COLING 2018] Learning Visually-Grounded Semantics from Contrastive Adversarial Samples.
☆58Sep 12, 2019Updated 6 years ago
CrossmodalGroup / GSMN
View on GitHub
Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching
☆170Oct 12, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ryankiros / visual-semantic-embedding
View on GitHub
Implementation of the image-sentence embedding method described in "Unifying Visual-Semantic Embeddings with Multimodal Neural Language M…
☆427Feb 9, 2017Updated 9 years ago
woodfrog / vse_infty
View on GitHub
Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021 (Oral)
☆165Aug 24, 2025Updated 10 months ago
HuiChen24 / IMRAM
View on GitHub
code for our CVPR2020 paper "IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval"
☆95Mar 8, 2020Updated 6 years ago
peteanderson80 / bottom-up-attention
View on GitHub
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
☆1,471Feb 3, 2023Updated 3 years ago
sunnychencool / AOQ
View on GitHub
Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)
☆34Jul 2, 2020Updated 6 years ago
hardyqr / HAL
View on GitHub
[AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".
☆38Oct 4, 2023Updated 2 years ago
yiling2018 / saem
View on GitHub
Learning Fragment Self-Attention Embeddings for Image-Text Matching, in ACM MM 2019
☆41Sep 24, 2019Updated 6 years ago
cshizhe / hgr_v2t
View on GitHub
Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".
☆211Jun 12, 2020Updated 6 years ago
ruotianluo / DiscCaptioning
View on GitHub
Code for Discriminability objective for training descriptive captions(CVPR 2018)
☆109Nov 21, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ZihaoWang-CV / CAMP_iccv19
View on GitHub
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
☆127Feb 26, 2020Updated 6 years ago
BruceW91 / CVSE
View on GitHub
The official source code for the paper Consensus-Aware Visual-Semantic Embedding for Image-Text Matching (ECCV 2020)
☆168Feb 7, 2022Updated 4 years ago
mesnico / TERAN
View on GitHub
Code and Resources for the Transformer Encoder Reasoning and Alignment Network (TERAN), accepted for publication in ACM Transactions on M…
☆74Dec 6, 2023Updated 2 years ago
niluthpol / multimodal_vtt
View on GitHub
Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval
☆68Apr 10, 2020Updated 6 years ago
LgQu / CAMERA
View on GitHub
Context-Aware Multi-View Summarization Network for Image-Text Matching. ACM MM'20
☆29May 26, 2022Updated 4 years ago
kywen1119 / DSRAN
View on GitHub
Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.
☆74Oct 25, 2022Updated 3 years ago
Paranioar / Awesome_Matching_Pretraining_Transfering
View on GitHub
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretr…
☆446Sep 25, 2025Updated 9 months ago
ruotianluo / self-critical.pytorch
View on GitHub
Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.
☆1,003Oct 5, 2023Updated 2 years ago
linxd5 / VSE_Pytorch
View on GitHub
Pytorch implementation of the image-sentence embedding method described in "Unifying Visual-Semantic Embeddings with Multimodal Neural La…
☆87Jul 17, 2017Updated 9 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
danieljf24 / dual_encoding
View on GitHub
[CVPR2019] Dual Encoding for Zero-Example Video Retrieval
☆153Jan 10, 2023Updated 3 years ago
lwwang / Two_branch_network
View on GitHub
☆83Dec 2, 2020Updated 5 years ago
Paranioar / SGRAF
View on GitHub
[AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”
☆220Apr 11, 2024Updated 2 years ago
ivendrov / order-embedding
View on GitHub
Implementation of caption-image retrieval from the paper "Order-Embeddings of Images and Language"
☆189Oct 13, 2016Updated 9 years ago
airsplay / lxmert
View on GitHub
PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".
☆967Oct 22, 2022Updated 3 years ago
ChenRocks / UNITER
View on GitHub
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
☆800Jun 30, 2021Updated 5 years ago
CrossmodalGroup / NAAF
View on GitHub
Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.
☆119Jun 19, 2023Updated 3 years ago
sunpeng981712364 / ACMR_demo
View on GitHub
☆93Oct 20, 2017Updated 8 years ago
li-xirong / avs
View on GitHub
Ad-hoc Video Search
☆29Feb 18, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
albanie / collaborative-experts
View on GitHub
Video embeddings for retrieval with natural language queries
☆344Feb 15, 2023Updated 3 years ago
LisaAnne / DCC
View on GitHub
Implementation of CVPR 2016 paper
☆74Jan 31, 2021Updated 5 years ago
Shiyang-Yan / Discrete-continous-PG-for-Retrieval
View on GitHub
☆13Feb 1, 2022Updated 4 years ago
aviveise / 2WayNet
View on GitHub
☆15Sep 19, 2017Updated 8 years ago
jiasenlu / vilbert_beta
View on GitHub
☆478Nov 21, 2022Updated 3 years ago
jwehrmann / lavse
View on GitHub
Language-Agnostic Visual-Semantic Embeddings (ICCV'19)
☆22Nov 11, 2019Updated 6 years ago
iLearn-Lab / SIGIR21-DIME
View on GitHub
Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21
☆70Apr 5, 2026Updated 3 months ago