AdrianHsu / S2VT-seq2seq-video-captioning-attention
S2VT (seq2seq) video captioning with bahdanau & luong attention implementation in Tensorflow
☆19Updated 6 years ago
Alternatives and similar repositories for S2VT-seq2seq-video-captioning-attention:
Users that are interested in S2VT-seq2seq-video-captioning-attention are comparing it to the libraries listed below
- Pytorch implementation of audio-visual fusion video captioning model☆27Updated 6 years ago
- [ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"☆57Updated 4 years ago
- Implementation for "Joint Event Detection and Description in Continuous Video Streams"☆23Updated 4 years ago
- ☆33Updated 6 years ago
- This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as i…☆166Updated 5 years ago
- TensorFlow Implementation of "Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification".☆40Updated 6 years ago
- 🎬 Video Captioning: ICCV '15 paper implementation☆47Updated 6 years ago
- Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2…☆149Updated 5 years ago
- A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015☆48Updated 2 years ago
- PyTorch implementation of video captioning☆13Updated 7 years ago
- Pytorch Implementation of Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning☆107Updated 7 years ago
- Evaluation code for Dense-Captioning Events in Videos☆123Updated 5 years ago
- Pytorch implementation of 'See, Hear, and Read: Deep Aligned Representations'☆33Updated 6 years ago
- PyTorch implementations for "Generating Visual Explanations" (GVE) and "Long-term Recurrent Convolutional Networks" (LRCN)☆92Updated 2 years ago
- A video captioning tool using S2VT method and attention mechanism (TensorFlow)☆15Updated 6 years ago
- converting the pretrained tensorflow SoundNet model to pytorch☆13Updated 2 years ago
- Word2VisualVec : Predicting Visual Features from Text for Image and Video Caption Retrieval☆69Updated 5 years ago
- some models for video caption implemented by pytorch. (S2VT)☆23Updated 7 years ago
- Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval☆67Updated 4 years ago
- pytorch implementation of video captioning☆400Updated 5 years ago
- Using Semantic Compositional Networks for Video Captioning☆96Updated 6 years ago
- The implementation of Sequential VLAD in Pytorch☆19Updated 5 years ago
- Mixture-of-Embeddings-Experts☆119Updated 4 years ago
- TensorFlow implementation for video classification.☆41Updated 6 years ago
- ☆13Updated 6 years ago
- video captioning using 3DCNN and LSTM (pytorch)☆10Updated 5 years ago
- Heterogeneous Memory Enhanced Multimodal Attention Model for VideoQA☆54Updated 3 years ago
- Extract video feature from C3D pretrained on Sports-1M and Kinetics☆15Updated 5 years ago
- Code for the paper: Audio-Visual Model Distillation Using Acoustic Images☆20Updated 2 years ago
- Supplementary material to "Top-down Visual Saliency Guided by Captions" (CVPR 2017)☆107Updated 7 years ago