Singularity42 / cap2vid
Attentive Semantic Video Generation using Captions
☆36Updated 7 years ago
Alternatives and similar repositories for cap2vid:
Users that are interested in cap2vid are comparing it to the libraries listed below
- Tensorflow implementation of the ICML 2017 paper: Learning to Generate Long-term Future via Hierarchical Prediction☆74Updated 6 years ago
- Code for CVPR'18 "Grounding Referring Expressions in Images by Variational Context"☆29Updated 6 years ago
- ☆61Updated 6 years ago
- Charades Object Detection Dataset (ICCV 2017)☆31Updated 6 years ago
- Tensorflow implement of paper: Optimization of image description metrics using policy gradient methods☆29Updated 6 years ago
- Diagnostic tools and additional visualizations from "What Actions are Needed for Understanding Human Actions in Videos?" ICCV 2017☆88Updated 7 years ago
- ActorObserverNet code in PyTorch from "Actor and Observer: Joint Modeling of First and Third-Person Videos", CVPR 2018☆77Updated 5 years ago
- ☆105Updated 6 years ago
- Pytorch Implementation of "Object level Visual Reasoning in Videos", F. Baradel, N. Neverova, C. Wolf, J. Mille, G. Mori , ECCV 2018☆172Updated 6 years ago
- Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding☆22Updated 6 years ago
- Weakly-Supervised Action Segmentation with Iterative Soft Boundary Assignment (CVPR 2018)☆41Updated 6 years ago
- Code for the paper Action Sets: Weakly Supervised Action Segmentation without Ordering Constraints☆30Updated 4 years ago
- PyTorch code for Reasoning Visual Dialogs with Structural and Partial Observations☆42Updated 3 years ago
- Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks☆32Updated 4 years ago
- VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation☆22Updated 7 years ago
- Hierarchical Video Generation from Orthogonal Information: Optical Flow and Texture (AAAI-18)☆32Updated 6 years ago
- Implementation of Hierarchical Long-term Video Prediction without Supervision☆90Updated 3 years ago
- Source code for paper "Towards Automatic Learning of Procedures from Web Instructional Videos"☆34Updated 6 years ago
- Localize objects in images using referring expressions☆37Updated 8 years ago
- Code for training temporal fully-connected CRF models in Torch☆68Updated 6 years ago
- Repository for MarioQA: Answering Questions by Watching Gameplay Videos in ICCV 2017☆10Updated 7 years ago
- Code release for Hu et al. Modeling Relationships in Referential Expressions with Compositional Modular Networks. in CVPR, 2017☆67Updated 6 years ago
- The source code of ECCV18 'Flow-Grounded Spatial-Temporal Video Prediction from Still Images'.☆60Updated 5 years ago
- The implementation of Temporal Generative Adversarial Nets with Singular Value Clipping☆78Updated 4 years ago
- Code and training data for our ECCV 2016 paper on Unsupervised Learning☆45Updated 3 years ago
- Supplementary material to "Top-down Visual Saliency Guided by Captions" (CVPR 2017)☆108Updated 6 years ago
- Source code for the paper "Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training"☆66Updated 5 years ago
- Code for the CVPR 2020 paper 'Action Modifiers: Learning from Adverbs in Instructional Videos'☆22Updated 3 years ago