Word2VisualVec : Predicting Visual Features from Text for Image and Video Caption Retrieval
☆70Jan 27, 2020Updated 6 years ago
Alternatives and similar repositories for w2vv
Users that are interested in w2vv are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ad-hoc Video Search☆28Feb 18, 2021Updated 5 years ago
- Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval☆68Apr 10, 2020Updated 5 years ago
- Deep Learning for Video Retrieval by Natural Language☆11Oct 20, 2019Updated 6 years ago
- [CVPR2019] Dual Encoding for Zero-Example Video Retrieval☆153Jan 10, 2023Updated 3 years ago
- PyTorch Implementation of Consensus-based Sequence Training for Video Captioning☆60May 15, 2018Updated 7 years ago
- ☆10Apr 20, 2018Updated 7 years ago
- Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"☆44Nov 19, 2019Updated 6 years ago
- ☆20Sep 19, 2019Updated 6 years ago
- ☆16Dec 17, 2018Updated 7 years ago
- Using Semantic Compositional Networks for Video Captioning☆96Nov 27, 2018Updated 7 years ago
- Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions. Code…☆13May 25, 2025Updated 9 months ago
- Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrie…☆88Jan 10, 2023Updated 3 years ago
- Feature Re-Learning with Data Augmentation for Video Relevance Prediction☆20Jan 10, 2023Updated 3 years ago
- Rethinking the Form of Latent States in Image Captioning☆20Aug 31, 2018Updated 7 years ago
- Video Grounding and Captioning☆332Oct 12, 2021Updated 4 years ago
- pytorch implementation of video captioning☆400Aug 19, 2019Updated 6 years ago
- Zero-shot image tagging by hierarchical semantic embedding☆76Dec 23, 2017Updated 8 years ago
- This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as i…☆172Oct 12, 2019Updated 6 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Nov 3, 2018Updated 7 years ago
- The source code of the paper: "To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression"☆30Jan 8, 2019Updated 7 years ago
- some models for video caption implemented by pytorch. (S2VT)☆23Feb 1, 2018Updated 8 years ago
- ☆32Jun 22, 2022Updated 3 years ago
- with reinforcement learning