Rangozhang / VideoCaption
Video captioning using LSTM and CNN. This is the Visual Learning project done by Rui Zhang, Yujia Huang and Yu Zhang
☆19Updated 8 years ago
Alternatives and similar repositories for VideoCaption:
Users that are interested in VideoCaption are comparing it to the libraries listed below
- Some scripts used for action recognition on UCF101 dataset☆11Updated 9 years ago
- Code for "Predictive-Corrective Networks for Action Detection"☆16Updated 7 years ago
- Actionness Estimation Using Hybrid Fully Convolutional Networks☆30Updated 8 years ago
- The code for shuttleNet.☆31Updated 7 years ago
- Code for Temporal Relation Networks☆24Updated 7 years ago
- Implementation of the Budgeted Super Networks☆25Updated 5 years ago
- Torch implementation for Stacked Attention Networks☆23Updated 8 years ago
- image caption with semantic attention☆11Updated 7 years ago
- FBN: Factorized Bilinear Models for Image Recognition (ICCV 2017)☆68Updated 7 years ago
- ☆59Updated 7 years ago
- Attend Refine Repeat: Active Box Proposal Generation via In-Out Localization☆62Updated 6 years ago
- Code for "Objects as Context for Part Detection".☆23Updated 6 years ago
- ☆64Updated 7 years ago
- Diagnostic tools and additional visualizations from "What Actions are Needed for Understanding Human Actions in Videos?" ICCV 2017☆88Updated 7 years ago
- Caffe implementation for Hu et al. Segmentation for Natural Language Expressions in arXiv:1603.06180, 2016 http://ronghanghu.com/text_obj…☆9Updated 8 years ago
- source code for Finding Action Tubes, CVPR 2015☆64Updated 8 years ago
- Project Uncovering Temporal Context for Video Question and Answering☆14Updated 8 years ago
- Temporal action proposals☆47Updated 5 years ago
- source code for the paper "Hard-Aware-Deeply-Cascaed-Embedding"☆32Updated 8 years ago
- Caffe: a fast open framework for deep learning.☆16Updated 8 years ago
- An open source deep learning action recognition and segmentation framework☆51Updated 7 years ago
- P-CNN: Pose-based CNN Features for Action Recognition☆51Updated 7 years ago
- Torch implementation of "Multiple Object Recognition with Visual Attention" on Kaggle Cats vs Dogs dataset☆20Updated 8 years ago
- ☆29Updated 7 years ago
- Multimodal Compact Bilinear Pooling for Torch7☆68Updated 8 years ago
- Weakly Supervised Object Localization with Progressive Domain Adaptation (CVPR 2016)☆62Updated 8 years ago
- An attempt to implement the recurrent attention model (RAM) from "Recurrent Models of Visual Attention" (Mnih+ 2014)☆43Updated 4 years ago
- Object detection using AZ-Net☆44Updated 8 years ago
- Localize objects in images using referring expressions☆36Updated 8 years ago
- Disentangling Motion, Foreground and Background Features in Videos☆26Updated 7 years ago