jalayrac / instructionVideos
Code for the paper "Unsupervised Learning from Narrated Instruction Videos", CVPR2016
☆19Updated 8 years ago
Alternatives and similar repositories for instructionVideos:
Users that are interested in instructionVideos are comparing it to the libraries listed below
- Code release for Hu et al. Modeling Relationships in Referential Expressions with Compositional Modular Networks. in CVPR, 2017☆66Updated 6 years ago
- Localize objects in images using referring expressions☆36Updated 8 years ago
- Charades Object Detection Dataset (ICCV 2017)☆31Updated 6 years ago
- [COLING 2018] Learning Visually-Grounded Semantics from Contrastive Adversarial Samples.☆57Updated 5 years ago
- ☆88Updated 3 years ago
- PyTorch code for Reasoning Visual Dialogs with Structural and Partial Observations☆42Updated 3 years ago
- Referring expression comprehension on ReferIt(RefClef)☆9Updated 8 years ago
- Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018☆71Updated 5 years ago
- Code for an ECCV2014 paper☆12Updated 10 years ago
- GuessWhat?! Baselines☆72Updated 2 years ago
- VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation☆22Updated 7 years ago
- ☆11Updated 7 years ago
- [ACL 2019] Visually Grounded Neural Syntax Acquisition☆89Updated 11 months ago
- Implementation of Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space☆58Updated 6 years ago
- DSTC8-AVSD: Sentence generation task for Audio Visual Scene-aware Dialog☆14Updated 3 years ago
- Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"☆44Updated 7 months ago
- Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog☆46Updated 5 years ago
- Project Uncovering Temporal Context for Video Question and Answering☆14Updated 8 years ago
- Visual Question Reasoning on General Dependency Tree☆30Updated 6 years ago
- Pre-trained V+L Data Preparation☆45Updated 4 years ago
- Sentence/Caption evaluation using automated metrics☆60Updated 8 years ago
- Scene Graph Parsing as Dependency Parsing☆41Updated 5 years ago
- Torch Implementation of Speaker-Listener-Reinforcer for Referring Expression Generation and Comprehension☆33Updated 6 years ago
- Code for ''A Simple Baseline for Audio-Visual Scene-Aware Dialog``☆25Updated 4 years ago
- Code for Learning to Learn Language from Narrated Video☆33Updated last year
- Generate a denotation graph from a set of image captions☆15Updated 6 years ago
- Visual Storytelling API☆35Updated 8 years ago
- Visual Verb Sense Disambiguation☆13Updated 5 years ago
- Adds SPICE metric to coco-caption evaluation server codes☆49Updated 2 years ago
- Hadamard Product for Low-rank Bilinear Pooling☆70Updated 7 years ago