Attentive Semantic Video Generation using Captions
☆36Oct 22, 2017Updated 8 years ago
Alternatives and similar repositories for cap2vid
Users that are interested in cap2vid are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sync-DRAW: Automatic Video Generation using Deep Recurrent Attentive Architectures☆12Oct 21, 2017Updated 8 years ago
- A video captioning tool using S2VT method and attention mechanism (TensorFlow)☆15Oct 14, 2018Updated 7 years ago
- ☆16Dec 17, 2018Updated 7 years ago
- Extension of hLSTMat☆19Apr 15, 2021Updated 5 years ago
- The paper of "Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning" accepted in International Joint Conference on Arti…☆16Jun 29, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Feb 21, 2022Updated 4 years ago
- Experience-embedded Visual Foresight, CoRL 2019☆14Nov 13, 2019Updated 6 years ago
- implement video caption based on openNMT☆36Apr 19, 2018Updated 8 years ago
- Code for "Controllable Video Generation with Sparse Trajectories" in PyTorch☆45May 14, 2018Updated 8 years ago
- Online Motion Retargeting☆14Nov 17, 2012Updated 13 years ago
- Tensorflow implementation of the ICML 2017 paper: Learning to Generate Long-term Future via Hierarchical Prediction☆74Oct 3, 2018Updated 7 years ago
- Unsupervised Perceptual Rewards for Imitation Learning☆10Feb 3, 2018Updated 8 years ago
- A pytorch implemention of MoCoGAN☆103Oct 18, 2017Updated 8 years ago
- ICNet in TensorFlow, Real-Time Segmentation☆10Aug 17, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- implementation of TDConvED for video captioning☆13Mar 18, 2020Updated 6 years ago
- [ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"☆61Oct 20, 2020Updated 5 years ago
- ☆19Nov 25, 2022Updated 3 years ago
- Perspective Transformer Layer☆18Nov 28, 2016Updated 9 years ago
- PyTorch code for the Findings of EMNLP 2021 paper "Does Vision-and-Language Pretraining Improve Lexical Grounding?"☆11Sep 26, 2021Updated 4 years ago
- A pytorch implementation of a text to videos GAN☆12Jul 26, 2019Updated 6 years ago
- Soft attention mechanism for video caption generation☆154Jul 17, 2017Updated 8 years ago
- PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025☆14Nov 21, 2025Updated 6 months ago
- The implementation of Temporal Generative Adversarial Nets with Singular Value Clipping☆79Jul 7, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [CVPR 2019] Text2Scene: Generating Compositional Scenes from Textual Descriptions☆119Apr 15, 2022Updated 4 years ago
- [AAAI 2019] A Layer-Based Sequential Framework for Scene Generation with GANs☆40May 25, 2020Updated 5 years ago
- PyTorch Implementation of Consensus-based Sequence Training for Video Captioning☆60May 15, 2018Updated 8 years ago
- Author implementation of "Contextualized Word Representations for Reading Comprehension" (Salant et al. 2017)☆11Jun 14, 2018Updated 7 years ago
- Official repository for Do Pre-trained Models Benefit Equally in Continual Learning? (Accepted to WACV'23)☆26Oct 20, 2024Updated last year
- Code for the COG dataset and network☆44Oct 17, 2018Updated 7 years ago
- ☆40Oct 29, 2025Updated 6 months ago
- ☆14Jan 30, 2017Updated 9 years ago
- Video Action Classification Using Spatial Temporal Clues. Original paper: arXiv:1504.01561☆23Jun 8, 2018Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆14Mar 31, 2022Updated 4 years ago
- Implementation of 'A Neural Compositional Paradigm for Image Captioning' by B. Dai, S.Fidler, D. Lin☆12Mar 15, 2019Updated 7 years ago
- Project Uncovering Temporal Context for Video Question and Answering☆14Apr 12, 2016Updated 10 years ago
- This repository contains some simple and useful scripts that can be helpful for handling data☆11May 8, 2023Updated 3 years ago
- 🎬 Video Captioning: ICCV '15 paper implementation☆47May 30, 2018Updated 7 years ago
- ☆67Jun 25, 2021Updated 4 years ago
- ☆20Sep 19, 2019Updated 6 years ago