Rangozhang / VideoCaption
Video captioning using LSTM and CNN. This is the Visual Learning project done by Rui Zhang, Yujia Huang and Yu Zhang
☆20Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for VideoCaption
- The code for shuttleNet.☆31Updated 7 years ago
- Code for "Predictive-Corrective Networks for Action Detection"☆16Updated 6 years ago
- Some scripts used for action recognition on UCF101 dataset☆12Updated 8 years ago
- Diagnostic tools and additional visualizations from "What Actions are Needed for Understanding Human Actions in Videos?" ICCV 2017☆88Updated 6 years ago
- Real-time smart webcam in TensorFlow trained on the Charades dataset☆15Updated 6 years ago
- ☆60Updated 6 years ago
- Actionness Estimation Using Hybrid Fully Convolutional Networks☆30Updated 8 years ago
- Code for Temporal Relation Networks☆24Updated 6 years ago
- This repository is intended to host tools and demos for ActivityNet☆21Updated 7 years ago
- A tensorflow implementation of Generating Videos with Scene Dynamics☆6Updated 3 years ago
- Project Uncovering Temporal Context for Video Question and Answering☆15Updated 8 years ago
- FBN: Factorized Bilinear Models for Image Recognition (ICCV 2017)☆68Updated 6 years ago
- image caption with semantic attention☆12Updated 7 years ago
- Temporal augmentation with two-stream ConvNet features on human action recognition☆18Updated 7 years ago
- Localize objects in images using referring expressions☆37Updated 8 years ago
- source code for Finding Action Tubes, CVPR 2015☆65Updated 8 years ago
- Progressive Attention Networks☆13Updated 8 years ago
- An open source deep learning action recognition and segmentation framework☆51Updated 7 years ago
- Sentence/Caption evaluation using automated metrics☆61Updated 8 years ago
- P-CNN: Pose-based CNN Features for Action Recognition☆51Updated 6 years ago
- Multimodal Compact Bilinear Pooling for Torch7☆68Updated 7 years ago
- Code and models of paper " Chained Multi-stream Networks Exploiting Pose, Motion, and Appearance for Action Classification and Detection"…☆27Updated 6 years ago
- Torch implementation for Stacked Attention Networks☆24Updated 7 years ago
- Code for reproducing the results in "HICO: A Benchmark for Recognizing Human-Object Interactions in Images"☆39Updated 5 months ago
- Temporal action proposals☆47Updated 5 years ago
- Caffe implementation for Hu et al. Segmentation for Natural Language Expressions in arXiv:1603.06180, 2016 http://ronghanghu.com/text_obj…☆10Updated 8 years ago