Singularity42 / cap2vid
Attentive Semantic Video Generation using Captions
☆35Updated 7 years ago
Alternatives and similar repositories for cap2vid
Users that are interested in cap2vid are comparing it to the libraries listed below
Sorting:
- Hierarchical Video Generation from Orthogonal Information: Optical Flow and Texture (AAAI-18)☆32Updated 6 years ago
- The source code of ECCV18 'Flow-Grounded Spatial-Temporal Video Prediction from Still Images'.☆60Updated 6 years ago
- Tensorflow implementation of the ICML 2017 paper: Learning to Generate Long-term Future via Hierarchical Prediction☆74Updated 6 years ago
- Code for CVPR'18 "Grounding Referring Expressions in Images by Variational Context"☆30Updated 6 years ago
- Diagnostic tools and additional visualizations from "What Actions are Needed for Understanding Human Actions in Videos?" ICCV 2017☆88Updated 7 years ago
- An open source deep learning action recognition and segmentation framework☆51Updated 7 years ago
- Tensorflow implement of paper: Optimization of image description metrics using policy gradient methods☆29Updated 6 years ago
- ☆61Updated 6 years ago
- Code for the paper Action Sets: Weakly Supervised Action Segmentation without Ordering Constraints☆30Updated 4 years ago
- The implementation of Temporal Generative Adversarial Nets with Singular Value Clipping☆79Updated 4 years ago
- Charades Object Detection Dataset (ICCV 2017)☆31Updated 6 years ago
- Given the previous frames of the video as input, we want to get the long-term frame prediction.☆32Updated 7 years ago
- CVPR18: Learning and Using the Arrow of Time☆21Updated 6 years ago
- Toolkit for the VLOG dataset☆37Updated 7 years ago
- Implementation of Hierarchical Long-term Video Prediction without Supervision☆90Updated 3 years ago
- Code for training temporal fully-connected CRF models in Torch☆68Updated 6 years ago
- ☆105Updated 7 years ago
- VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation☆22Updated 7 years ago
- Localize objects in images using referring expressions☆37Updated 8 years ago
- Source code for the paper "Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training"☆66Updated 6 years ago
- ☆47Updated 7 years ago
- Visual Dynamics: Probabilistic Future Frame Synthesis via Cross Convolutional Networks☆74Updated 7 years ago
- Official code for paper Context-aware Zero-shot Recognition (https://arxiv.org/abs/1904.09320 to appear at AAAI2020)☆58Updated 5 years ago
- Code for Temporal Relation Networks☆24Updated 7 years ago
- Code for "Controllable Video Generation with Sparse Trajectories" in PyTorch☆45Updated 6 years ago
- Weakly-Supervised Action Segmentation with Iterative Soft Boundary Assignment (CVPR 2018)☆41Updated 7 years ago
- Im2Flow: Motion Hallucination from Static Images for Action Recognition (CVPR 2018)☆54Updated 6 years ago
- Video captioning using LSTM and CNN. This is the Visual Learning project done by Rui Zhang, Yujia Huang and Yu Zhang☆19Updated 9 years ago
- Project Uncovering Temporal Context for Video Question and Answering☆14Updated 9 years ago
- Repository for MarioQA: Answering Questions by Watching Gameplay Videos in ICCV 2017☆10Updated 7 years ago