yaoli / arctic-capgen-vidView external linksLinks
automatic video description generation with GPU training
☆257Jan 12, 2020Updated 6 years ago
Alternatives and similar repositories for arctic-capgen-vid
Users that are interested in arctic-capgen-vid are comparing it to the libraries listed below
Sorting:
- Caffe☆205Oct 9, 2017Updated 8 years ago
- Soft attention mechanism for video caption generation☆154Jul 17, 2017Updated 8 years ago
- Implementation of "Sequence to Sequence – Video to Text"☆266Apr 8, 2017Updated 8 years ago
- Using Semantic Compositional Networks for Video Captioning☆96Nov 27, 2018Updated 7 years ago
- Sentence/Caption evaluation using automated metrics☆61May 5, 2016Updated 9 years ago
- [ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"☆61Oct 20, 2020Updated 5 years ago
- Implementation of caption-image retrieval from the paper "Order-Embeddings of Images and Language"☆189Oct 13, 2016Updated 9 years ago
- A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015☆48Nov 22, 2022Updated 3 years ago
- ☆967Sep 25, 2023Updated 2 years ago
- A Pytorch implementation of "Reconstruction Network for Video Captioning", CVPR 2018☆53Apr 6, 2020Updated 5 years ago
- Action recognition using soft attention based deep recurrent neural networks☆353Oct 30, 2016Updated 9 years ago
- pytorch implementation of video captioning☆399Aug 19, 2019Updated 6 years ago
- Simple Baseline for Visual Question Answering☆187Dec 21, 2016Updated 9 years ago
- PyTorch Implementation of Consensus-based Sequence Training for Video Captioning☆60May 15, 2018Updated 7 years ago
- Video to Language Challenge (MSR-VTT Challenge 2016)☆32Dec 28, 2017Updated 8 years ago
- Diagnostic tools and additional visualizations from "What Actions are Needed for Understanding Human Actions in Videos?" ICCV 2017☆88Dec 19, 2017Updated 8 years ago
- DPPnet: Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction☆96Apr 20, 2016Updated 9 years ago
- Train a deeper LSTM and normalized CNN Visual Question Answering model. This current code can get 58.16 on OpenEnded and 63.09 on Multipl…☆387Mar 22, 2019Updated 6 years ago
- ☆33May 17, 2016Updated 9 years ago
- Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"☆44Nov 19, 2019Updated 6 years ago
- Spatio-temporal video autoencoder with convolutional LSTMs☆294Jul 5, 2016Updated 9 years ago
- C3D is a modified version of BVLC caffe to support 3D ConvNets.☆1,182Jul 31, 2019Updated 6 years ago
- Implement Natural Language Object Retrieval in tensorflow☆35Nov 30, 2016Updated 9 years ago
- ☆16Dec 17, 2018Updated 7 years ago
- Attend Refine Repeat: Active Box Proposal Generation via In-Out Localization☆62Feb 12, 2019Updated 7 years ago
- This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as i…☆172Oct 12, 2019Updated 6 years ago
- Evaluation code for Dense-Captioning Events in Videos☆130Jun 11, 2019Updated 6 years ago
- some models for video caption implemented by pytorch. (S2VT)☆23Feb 1, 2018Updated 8 years ago
- ☆1,217May 13, 2024Updated last year
- Dense image captioning in Torch☆1,599Jul 31, 2018Updated 7 years ago
- Repository for our CVPR 2017 and IJCV: TGIF-QA☆177Sep 6, 2021Updated 4 years ago
- Trajectory-pooled Deep-Convolutional Descriptors☆106Aug 24, 2017Updated 8 years ago
- Deep Networks with Stochastic Depth☆481Aug 13, 2018Updated 7 years ago
- Implementation of the image-sentence embedding method described in "Unifying Visual-Semantic Embeddings with Multimodal Neural Language M…☆427Feb 9, 2017Updated 9 years ago
- ☆218Aug 13, 2016Updated 9 years ago
- ☆33Apr 20, 2018Updated 7 years ago
- implement video caption based on openNMT☆36Apr 19, 2018Updated 7 years ago
- ☆115Aug 7, 2016Updated 9 years ago
- Code for paper "Exploring Models and Data for Image Question Answering"☆81Mar 23, 2016Updated 9 years ago