☆33Apr 20, 2018Updated 7 years ago
Alternatives and similar repositories for video-caption.pytorch
Users that are interested in video-caption.pytorch are comparing it to the libraries listed below
Sorting:
- implement video caption based on openNMT☆36Apr 19, 2018Updated 7 years ago
- video captioning using 3DCNN and LSTM (pytorch)☆11Sep 26, 2019Updated 6 years ago
- Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"☆44Nov 19, 2019Updated 6 years ago
- pytorch implementation of video captioning☆399Aug 19, 2019Updated 6 years ago
- ☆16Dec 17, 2018Updated 7 years ago
- [ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"☆61Oct 20, 2020Updated 5 years ago
- Extension of hLSTMat☆19Apr 15, 2021Updated 4 years ago
- The paper of "Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning" accepted in International Joint Conference on Arti…☆16Jun 29, 2017Updated 8 years ago
- Codes for paper of "Attention-based LSTM with Semantic Consistency for Videos Captioning "☆18Mar 22, 2017Updated 8 years ago
- some models for video caption implemented by pytorch. (S2VT)☆23Feb 1, 2018Updated 8 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Nov 3, 2018Updated 7 years ago
- implementation of TDConvED for video captioning☆13Mar 18, 2020Updated 5 years ago
- PyTorch Implementation of Consensus-based Sequence Training for Video Captioning☆60May 15, 2018Updated 7 years ago
- Extract video feature from C3D pretrained on Sports-1M and Kinetics☆15Jul 2, 2019Updated 6 years ago
- ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network☆68Nov 19, 2019Updated 6 years ago
- Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2…☆152Jul 8, 2019Updated 6 years ago
- IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning☆79Nov 23, 2020Updated 5 years ago
- Poet: Product-oriented Video Captioner for E-commerce☆12Sep 21, 2020Updated 5 years ago
- Video to Language Challenge (MSR-VTT Challenge 2016)☆32Dec 28, 2017Updated 8 years ago
- Using Semantic Compositional Networks for Video Captioning☆96Nov 27, 2018Updated 7 years ago
- Prototypical Contrast and Reverse Prediction: Unsupervised Skeleton based Action Recognition☆11Aug 30, 2021Updated 4 years ago
- ☆14Jan 30, 2017Updated 9 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 5 years ago
- Video captioning on MSR-VTT Dataset☆12Mar 21, 2021Updated 4 years ago
- Implementation for "Multilevel Language and Vision Integration for Text-to-Clip Retrieval"☆49Jan 21, 2019Updated 7 years ago
- Pytorch implementation of audio-visual fusion video captioning model☆27Jul 26, 2018Updated 7 years ago
- Evaluation code for Dense-Captioning Events in Videos☆130Jun 11, 2019Updated 6 years ago
- Soft attention mechanism for video caption generation☆154Jul 17, 2017Updated 8 years ago
- Video Grounding and Captioning☆332Oct 12, 2021Updated 4 years ago
- Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy☆55Jul 31, 2021Updated 4 years ago
- Connective Cognition Network for Directional Visual Commonsense Reasoning☆15May 6, 2021Updated 4 years ago
- Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised man…☆104Mar 21, 2020Updated 5 years ago
- Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding☆33Aug 29, 2019Updated 6 years ago
- Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)☆34Jul 17, 2019Updated 6 years ago
- automatic video description generation with GPU training☆256Jan 12, 2020Updated 6 years ago
- Study of frame rate effects on MSR-VTT dataset☆14Feb 10, 2018Updated 8 years ago
- Code for ECCV 2020 paper "Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language"☆17Aug 25, 2020Updated 5 years ago
- A video captioning tool using S2VT method and attention mechanism (TensorFlow)☆15Oct 14, 2018Updated 7 years ago
- ☆22Jan 14, 2026Updated last month