malmaud / whats_cookin
Dataset generated by the methods in "What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision"
☆20Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for whats_cookin
- Implement Natural Language Object Retrieval in tensorflow☆36Updated 7 years ago
- ☆24Updated 7 years ago
- Code for reproducing the results in "Mining Semantic Affordances of Visual Object Categories"☆10Updated 5 months ago
- Visual Storytelling API☆35Updated 7 years ago
- Localize objects in images using referring expressions☆37Updated 8 years ago
- Code for Unsupervised Discovery of Multimodal Links in Multi-Image/Multi-Sentence Documents☆30Updated 4 years ago
- Variational autoencoder in Theano☆12Updated 7 years ago
- ☆11Updated 7 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Updated 7 years ago
- Generate a denotation graph from a set of image captions☆15Updated 6 years ago
- Visual question answering for CVPR16 VQA Challenge.☆41Updated 8 years ago
- An implementation of the NAACL'18 paper "Punny Captions: Witty Wordplay in Image Descriptions".☆33Updated 6 years ago
- Benchmark data and code for Question-Answering on Movie stories☆43Updated 4 years ago
- caption images w/ visual attn☆9Updated 7 years ago
- Visual Verb Sense Disambiguation☆13Updated 5 years ago
- Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering☆25Updated 4 years ago
- Cornell House Agent Learning Environment☆47Updated 2 years ago
- Website for TextVQA dataset.☆28Updated last year
- Representations of language in a model of visually grounded speech signal.☆23Updated 6 years ago
- Code for the paper "Unsupervised Learning from Narrated Instruction Videos", CVPR2016☆19Updated 8 years ago
- A lean, mean, very quickly deployable ExternalQuestion template for Amazon Mechanical Turk. Simplified as a static page.☆16Updated 7 years ago
- ☆11Updated 7 years ago
- Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog☆44Updated 4 years ago
- ☆48Updated last year
- GuessWhat?! Baselines☆73Updated 2 years ago
- Torch implementation for Stacked Attention Networks☆24Updated 8 years ago
- Implements an MLP for VQA☆8Updated 8 years ago