Rangozhang / VideoCaption
Video captioning using LSTM and CNN. This is the Visual Learning project done by Rui Zhang, Yujia Huang and Yu Zhang
☆19Updated 9 years ago
Alternatives and similar repositories for VideoCaption:
Users that are interested in VideoCaption are comparing it to the libraries listed below
- Code for "Predictive-Corrective Networks for Action Detection"☆16Updated 7 years ago
- Caffe implementation for Hu et al. Segmentation for Natural Language Expressions in arXiv:1603.06180, 2016 http://ronghanghu.com/text_obj…☆9Updated 8 years ago
- Actionness Estimation Using Hybrid Fully Convolutional Networks☆30Updated 8 years ago
- Some scripts used for action recognition on UCF101 dataset☆11Updated 9 years ago
- An open source deep learning action recognition and segmentation framework☆51Updated 7 years ago
- Diagnostic tools and additional visualizations from "What Actions are Needed for Understanding Human Actions in Videos?" ICCV 2017☆88Updated 7 years ago
- Implementation of the Budgeted Super Networks☆25Updated 6 years ago
- Charades Object Detection Dataset (ICCV 2017)☆31Updated 6 years ago
- Torch implementation of "Multiple Object Recognition with Visual Attention" on Kaggle Cats vs Dogs dataset☆20Updated 8 years ago
- Given the previous frames of the video as input, we want to get the long-term frame prediction.☆32Updated 7 years ago
- Localize objects in images using referring expressions☆37Updated 8 years ago
- Code for Temporal Relation Networks☆24Updated 7 years ago
- Disentangling Motion, Foreground and Background Features in Videos☆26Updated 7 years ago
- Weakly Supervised Object Localization with Progressive Domain Adaptation (CVPR 2016)☆62Updated 8 years ago
- This repository is intended to host tools and demos for ActivityNet☆21Updated 8 years ago
- Real-time smart webcam in TensorFlow trained on the Charades dataset☆15Updated 7 years ago
- Attend Refine Repeat: Active Box Proposal Generation via In-Out Localization☆62Updated 6 years ago
- The code for shuttleNet.☆31Updated 7 years ago
- codes for ECCV 2016☆9Updated 7 years ago
- source code for Finding Action Tubes, CVPR 2015☆64Updated 8 years ago
- FBN: Factorized Bilinear Models for Image Recognition (ICCV 2017)☆68Updated 7 years ago
- The source code for Temporal Attention-Gated Model.☆21Updated 7 years ago
- Toolkit for the VLOG dataset☆37Updated 7 years ago
- My implementation (PyTorch) for the paper SST: Single-Stream Temporal Action Proposals (http://vision.stanford.edu/pdf/buch2017cvpr.pdf).☆10Updated 2 years ago
- DualNet: Learn Complementary Features for Image Recognition☆19Updated 7 years ago
- P-CNN: Pose-based CNN Features for Action Recognition☆52Updated 7 years ago
- Caffe☆32Updated 7 years ago
- source code for the paper "Hard-Aware-Deeply-Cascaed-Embedding"☆32Updated 8 years ago
- ☆59Updated 7 years ago
- image caption with semantic attention☆11Updated 8 years ago