Word2VisualVec : Predicting Visual Features from Text for Image and Video Caption Retrieval
☆70Jan 27, 2020Updated 6 years ago
Alternatives and similar repositories for w2vv
Users that are interested in w2vv are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ad-hoc Video Search☆28Feb 18, 2021Updated 5 years ago
- Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval☆68Apr 10, 2020Updated 6 years ago
- Deep Learning for Video Retrieval by Natural Language☆11Oct 20, 2019Updated 6 years ago
- [CVPR2019] Dual Encoding for Zero-Example Video Retrieval☆153Jan 10, 2023Updated 3 years ago
- PyTorch Implementation of Consensus-based Sequence Training for Video Captioning☆60May 15, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆10Apr 20, 2018Updated 8 years ago
- Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"☆44Nov 19, 2019Updated 6 years ago
- ☆20Sep 19, 2019Updated 6 years ago
- ☆16Dec 17, 2018Updated 7 years ago
- Using Semantic Compositional Networks for Video Captioning☆96Nov 27, 2018Updated 7 years ago
- Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions. Code…☆13May 25, 2025Updated 11 months ago
- Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrie…☆88Jan 10, 2023Updated 3 years ago
- Feature Re-Learning with Data Augmentation for Video Relevance Prediction☆20Jan 10, 2023Updated 3 years ago
- Rethinking the Form of Latent States in Image Captioning☆20Aug 31, 2018Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Video Grounding and Captioning☆332Oct 12, 2021Updated 4 years ago
- pytorch implementation of video captioning☆400Aug 19, 2019Updated 6 years ago
- Zero-shot image tagging by hierarchical semantic embedding☆76Dec 23, 2017Updated 8 years ago
- This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as i…☆171Oct 12, 2019Updated 6 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Nov 3, 2018Updated 7 years ago
- The source code of the paper: "To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression"☆30Jan 8, 2019Updated 7 years ago
- some models for video caption implemented by pytorch. (S2VT)☆23Feb 1, 2018Updated 8 years ago
- ☆32Jun 22, 2022Updated 3 years ago
- with reinforcement learning☆32May 19, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- W2VV++: A fully deep learning solution for ad-hoc video search☆29Jul 25, 2024Updated last year
- ☆30Oct 2, 2018Updated 7 years ago
- ☆19May 2, 2020Updated 6 years ago
- Stack-Captioning: Coarse-to-Fine Learning for Image Captioning☆63Apr 18, 2018Updated 8 years ago
- Re-implementation of the work Livebot☆16Jun 21, 2020Updated 5 years ago
- Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2…☆152Jul 8, 2019Updated 6 years ago
- Evaluation code for Dense-Captioning Events in Videos☆130Jun 11, 2019Updated 6 years ago
- Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)☆34Jul 17, 2019Updated 6 years ago
- Extension of hLSTMat☆19Apr 15, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Source code for the paper "Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training"☆66Apr 18, 2019Updated 7 years ago
- image caption with semantic attention☆11Apr 1, 2017Updated 9 years ago
- [ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning☆171Dec 4, 2020Updated 5 years ago
- PyTorch implementation of Multiple-instance learning☆117May 13, 2019Updated 6 years ago
- Source code for paper "Towards Automatic Learning of Procedures from Web Instructional Videos"☆34Jan 6, 2019Updated 7 years ago
- Code for detecting visual concepts in images.☆150Feb 27, 2018Updated 8 years ago
- ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network☆68Nov 19, 2019Updated 6 years ago