Word2VisualVec : Predicting Visual Features from Text for Image and Video Caption Retrieval
☆70Jan 27, 2020Updated 6 years ago
Alternatives and similar repositories for w2vv
Users that are interested in w2vv are comparing it to the libraries listed below
Sorting:
- Ad-hoc Video Search☆28Feb 18, 2021Updated 5 years ago
- Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval☆68Apr 10, 2020Updated 5 years ago
- Deep Learning for Video Retrieval by Natural Language☆11Oct 20, 2019Updated 6 years ago
- [CVPR2019] Dual Encoding for Zero-Example Video Retrieval☆153Jan 10, 2023Updated 3 years ago
- Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"☆44Nov 19, 2019Updated 6 years ago
- PyTorch Implementation of Consensus-based Sequence Training for Video Captioning☆60May 15, 2018Updated 7 years ago
- ☆10Apr 20, 2018Updated 7 years ago
- ☆20Sep 19, 2019Updated 6 years ago
- Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrie…☆88Jan 10, 2023Updated 3 years ago
- W2VV++: A fully deep learning solution for ad-hoc video search☆29Jul 25, 2024Updated last year
- Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions. Code…☆13May 25, 2025Updated 9 months ago
- with reinforcement learning☆31May 19, 2020Updated 5 years ago
- ☆16Dec 17, 2018Updated 7 years ago
- Using Semantic Compositional Networks for Video Captioning☆96Nov 27, 2018Updated 7 years ago
- Video Grounding and Captioning☆332Oct 12, 2021Updated 4 years ago
- ☆19May 2, 2020Updated 5 years ago
- pytorch implementation of video captioning☆399Aug 19, 2019Updated 6 years ago
- ☆32Jun 22, 2022Updated 3 years ago
- Source code for paper "Towards Automatic Learning of Procedures from Web Instructional Videos"☆34Jan 6, 2019Updated 7 years ago
- Feature Re-Learning with Data Augmentation for Video Relevance Prediction☆20Jan 10, 2023Updated 3 years ago
- This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as i…☆172Oct 12, 2019Updated 6 years ago
- Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)☆34Jul 17, 2019Updated 6 years ago
- The source code of the paper: "To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression"☆30Jan 8, 2019Updated 7 years ago
- Source code for the paper "Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training"☆66Apr 18, 2019Updated 6 years ago
- Authors official Tensorflow implementation of the "Near-Duplicate Video Retrieval with Deep Metric Learning" [ICCVW 2017]☆120Apr 12, 2023Updated 2 years ago
- Stack-Captioning: Coarse-to-Fine Learning for Image Captioning☆63Apr 18, 2018Updated 7 years ago
- some models for video caption implemented by pytorch. (S2VT)☆23Feb 1, 2018Updated 8 years ago
- Rethinking the Form of Latent States in Image Captioning☆20Aug 31, 2018Updated 7 years ago
- [ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning☆171Dec 4, 2020Updated 5 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Nov 3, 2018Updated 7 years ago
- Video embeddings for retrieval with natural language queries☆342Feb 15, 2023Updated 3 years ago
- Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2…☆152Jul 8, 2019Updated 6 years ago
- Multi-index hashing for the resolution of ANN search problem on large datasets☆15Oct 16, 2018Updated 7 years ago
- image caption with semantic attention☆11Apr 1, 2017Updated 8 years ago
- Source code for "Recurrent Fusion Network for Image Captioning".☆23Nov 24, 2018Updated 7 years ago
- Zero-shot image tagging by hierarchical semantic embedding☆76Dec 23, 2017Updated 8 years ago
- Video classification, youtube8m, Knowledge distillation, Tensorflow, NeXtVLAD☆27Sep 5, 2019Updated 6 years ago
- Video-Text Representation Learning via Differentiable Weak Temporal Alignment (PyTorch implementation for the CVPR 2022 paper)☆11Oct 12, 2022Updated 3 years ago
- This repo is used for generating faking labeled positive videos for SVD dataset.☆10Aug 16, 2020Updated 5 years ago