automatic video description generation with GPU training
☆256Jan 12, 2020Updated 6 years ago
Alternatives and similar repositories for arctic-capgen-vid
Users that are interested in arctic-capgen-vid are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Caffe☆205Oct 9, 2017Updated 8 years ago
- Soft attention mechanism for video caption generation☆154Jul 17, 2017Updated 8 years ago
- [ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"☆61Oct 20, 2020Updated 5 years ago
- Implementation of "Sequence to Sequence – Video to Text"☆266Apr 8, 2017Updated 9 years ago
- Using Semantic Compositional Networks for Video Captioning☆96Nov 27, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Sentence/Caption evaluation using automated metrics☆61May 5, 2016Updated 10 years ago
- A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015☆48Nov 22, 2022Updated 3 years ago
- PyTorch Implementation of Consensus-based Sequence Training for Video Captioning☆60May 15, 2018Updated 7 years ago
- Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"☆44Nov 19, 2019Updated 6 years ago
- pytorch implementation of video captioning☆400Aug 19, 2019Updated 6 years ago
- A Pytorch implementation of "Reconstruction Network for Video Captioning", CVPR 2018☆53Apr 6, 2020Updated 6 years ago
- some models for video caption implemented by pytorch. (S2VT)☆23Feb 1, 2018Updated 8 years ago
- Video to Language Challenge (MSR-VTT Challenge 2016)☆32Dec 28, 2017Updated 8 years ago
- implement video caption based on openNMT☆36Apr 19, 2018Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as i…☆171Oct 12, 2019Updated 6 years ago
- Implementation of caption-image retrieval from the paper "Order-Embeddings of Images and Language"☆189Oct 13, 2016Updated 9 years ago
- ☆16Dec 17, 2018Updated 7 years ago
- ☆966Sep 25, 2023Updated 2 years ago
- Action recognition using soft attention based deep recurrent neural networks☆354Oct 30, 2016Updated 9 years ago
- Diagnostic tools and additional visualizations from "What Actions are Needed for Understanding Human Actions in Videos?" ICCV 2017☆88Dec 19, 2017Updated 8 years ago
- Simple Baseline for Visual Question Answering☆186Dec 21, 2016Updated 9 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Nov 3, 2018Updated 7 years ago
- ☆33Apr 20, 2018Updated 8 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- C3D is a modified version of BVLC caffe to support 3D ConvNets.☆1,184Jul 31, 2019Updated 6 years ago
- DPPnet: Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction☆96Apr 20, 2016Updated 10 years ago
- The Theano code for the CVPR 2017 paper "Semantic Compositional Networks for Visual Captioning"☆68Mar 26, 2018Updated 8 years ago
- ☆1,219May 13, 2024Updated last year
- with reinforcement learning☆32May 19, 2020Updated 5 years ago
- ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network☆68Nov 19, 2019Updated 6 years ago
- Spatio-temporal video autoencoder with convolutional LSTMs☆293Jul 5, 2016Updated 9 years ago
- Trajectory-pooled Deep-Convolutional Descriptors☆106Aug 24, 2017Updated 8 years ago
- Implement Natural Language Object Retrieval in tensorflow☆35Nov 30, 2016Updated 9 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Train a deeper LSTM and normalized CNN Visual Question Answering model. This current code can get 58.16 on OpenEnded and 63.09 on Multipl…☆386Mar 22, 2019Updated 7 years ago
- Repository for our CVPR 2017 and IJCV: TGIF-QA☆179Sep 6, 2021Updated 4 years ago
- Extension of hLSTMat☆19Apr 15, 2021Updated 5 years ago
- Video Grounding and Captioning☆332Oct 12, 2021Updated 4 years ago
- Evaluation code for Dense-Captioning Events in Videos☆130Jun 11, 2019Updated 6 years ago
- ☆33May 17, 2016Updated 9 years ago
- A video captioning tool using S2VT method and attention mechanism (TensorFlow)☆15Oct 14, 2018Updated 7 years ago