The paper of "Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning" accepted in International Joint Conference on Artificial Intelligence (IJCAI) 2017
☆16Jun 29, 2017Updated 9 years ago
Alternatives and similar repositories for hLSTMat
Users that are interested in hLSTMat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Extension of hLSTMat☆19Apr 15, 2021Updated 5 years ago
- PyTorch Implementation of Consensus-based Sequence Training for Video Captioning☆60May 15, 2018Updated 8 years ago
- Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"☆44Nov 19, 2019Updated 6 years ago
- ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network☆68Nov 19, 2019Updated 6 years ago
- implement video caption based on openNMT☆36Apr 19, 2018Updated 8 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning☆79Nov 23, 2020Updated 5 years ago
- implementation of TDConvED for video captioning☆13Mar 18, 2020Updated 6 years ago
- Codes for paper of "Attention-based LSTM with Semantic Consistency for Videos Captioning "☆18Mar 22, 2017Updated 9 years ago
- ☆14Jan 30, 2017Updated 9 years ago
- A video captioning tool using S2VT method and attention mechanism (TensorFlow)☆15Oct 14, 2018Updated 7 years ago
- [ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"☆61Oct 20, 2020Updated 5 years ago
- video captioning using 3DCNN and LSTM (pytorch)☆11Sep 26, 2019Updated 6 years ago
- ☆20Sep 19, 2019Updated 6 years ago
- Attentive Semantic Video Generation using Captions☆36Oct 22, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Study of frame rate effects on MSR-VTT dataset☆14Feb 10, 2018Updated 8 years ago
- A curated list of research papers in Video Captioning☆121Jan 5, 2021Updated 5 years ago
- ☆23Apr 12, 2022Updated 4 years ago
- Extract video feature from C3D pretrained on Sports-1M and Kinetics☆16Jul 2, 2019Updated 7 years ago
- A Pytorch implementation of "Reconstruction Network for Video Captioning", CVPR 2018☆53Apr 6, 2020Updated 6 years ago
- Soft attention mechanism for video caption generation☆154Jul 17, 2017Updated 8 years ago
- Pytorch Implementation of Videos as Space-Time Region Graphs☆27Jun 10, 2026Updated 3 weeks ago
- ☆10Dec 28, 2018Updated 7 years ago
- Supplementary material to "Top-down Visual Saliency Guided by Captions" (CVPR 2017)☆107Jan 22, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Source code for Delving Deeper into the Decoder for Video Captioning☆39Jun 1, 2021Updated 5 years ago
- the source code of Multi-modal Circulant Fusion (MCF) for Temporal Activity Localization☆24Mar 10, 2019Updated 7 years ago
- Extension of Self-Supervised Temporal Hashing☆15Apr 15, 2021Updated 5 years ago
- ☆31Jun 2, 2018Updated 8 years ago
- PyTorch implementation of L-GCN [https://arxiv.org/abs/2008.09105]☆25Apr 25, 2021Updated 5 years ago
- Caffe implementation of paper: "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering"☆29Oct 24, 2018Updated 7 years ago
- ☆19Nov 25, 2022Updated 3 years ago
- Pytorch implementation of audio-visual fusion video captioning model☆27Jul 26, 2018Updated 7 years ago
- PyTorch code for the Findings of EMNLP 2021 paper "Does Vision-and-Language Pretraining Improve Lexical Grounding?"☆11Sep 26, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Tensorflow implement of paper: Sequence to Sequence: Video to Text☆88Jul 31, 2018Updated 7 years ago
- source code for Finding Action Tubes, CVPR 2015☆64Jun 22, 2016Updated 10 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 6 years ago
- with reinforcement learning☆32May 19, 2020Updated 6 years ago
- Project Uncovering Temporal Context for Video Question and Answering☆14Apr 12, 2016Updated 10 years ago
- All files, presentations and documents used in workshops, meetups and seminars☆14Mar 26, 2020Updated 6 years ago
- Large-Vocabulary Continuous Sign Language Recognition, 2024☆16May 30, 2024Updated 2 years ago