malmaud / whats_cookin
Dataset generated by the methods in "What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision"
☆21Updated 9 years ago
Alternatives and similar repositories for whats_cookin
Users that are interested in whats_cookin are comparing it to the libraries listed below
Sorting:
- ☆11Updated 7 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Updated 7 years ago
- Code for reproducing the results in "Mining Semantic Affordances of Visual Object Categories"☆11Updated 11 months ago
- ☆24Updated 8 years ago
- Code for the paper "Unsupervised Learning from Narrated Instruction Videos", CVPR2016☆19Updated 8 years ago
- imperative programming in TensorFlow☆18Updated 8 years ago
- Visual Verb Sense Disambiguation☆13Updated 6 years ago
- Egocentric Video Description based on Temporally-Linked Sequences☆11Updated 7 years ago
- Variational autoencoder in Theano☆12Updated 7 years ago
- Cornell House Agent Learning Environment☆47Updated 2 years ago
- Implement Natural Language Object Retrieval in tensorflow☆35Updated 8 years ago
- Generate a denotation graph from a set of image captions☆15Updated 6 years ago
- Convexified Convolutional Neural Networks☆15Updated 8 years ago
- BISON: Binary Image SelectiON☆49Updated 3 years ago
- ADVISE model for the Automatic Understanding of Visual Advertisements challenge. Please refer to our ECCV paper "ADVISE: Symbolism and Ex…☆26Updated 6 years ago
- Benchmark data and code for Question-Answering on Movie stories☆43Updated 5 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Updated 8 years ago
- Code for Unsupervised Discovery of Multimodal Links in Multi-Image/Multi-Sentence Documents☆30Updated 4 years ago
- Multi-Target Embodied Question Answering☆26Updated 4 years ago
- A lean, mean, very quickly deployable ExternalQuestion template for Amazon Mechanical Turk. Simplified as a static page.☆16Updated 8 years ago
- Models and Codes for the paper Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions☆14Updated 6 years ago
- Contains code for the EMNLP paper `Learning Linguistic Attributes for Zero-Shot Verb Classification'☆26Updated 7 years ago
- Code to replicate "Generating Visual Explanations"☆49Updated 4 years ago
- ☆18Updated 8 years ago
- Code for "So similar and yet incompatible: Toward the automated identification of semantically compatible words" in NAACL 2015 proceedi…☆11Updated 10 years ago
- ☆13Updated 7 years ago
- Transfer Learning via Unsupervised Task Discovery for Visual Question Answering☆19Updated 6 years ago
- a list of recent papers on transfer learning☆24Updated 7 years ago
- Website for TextVQA dataset.☆28Updated 2 years ago
- An implementation of the NAACL'18 paper "Punny Captions: Witty Wordplay in Image Descriptions".☆33Updated 6 years ago