hucvl / prn
Procedural Reasoning Networks
☆7Updated 3 years ago
Related projects: ⓘ
- ☆44Updated 3 years ago
- ☆15Updated 3 months ago
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Updated last year
- [EMNLP 2021] Code and data for our paper "Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers…☆20Updated 2 years ago
- Code for EMNLP 2020 paper CoDIR☆41Updated last year
- Cross-modal Coherence Modeling for Caption Generation☆11Updated 4 years ago
- A dataset of crowdsourced ratings for machine-generated image captions☆31Updated 5 years ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 3 years ago
- Visual Storytelling post-edit dataset☆17Updated 4 years ago
- Weakly-supervised action segmentation in video☆16Updated 2 years ago
- Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks☆13Updated last year
- Codes of AAAI 2020 paper "What Makes A Good Story? Designing Composite Rewards for Visual Storytelling"☆26Updated 3 years ago
- [EMNLP 2021] Code and data for our paper "Visually Grounded Reasoning across Languages and Cultures"☆28Updated 2 years ago
- Implementation of "MULE: Multimodal Universal Language Embedding"☆15Updated 4 years ago
- ☆19Updated this week
- Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT…☆21Updated 3 years ago
- ☆20Updated last year
- Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)☆27Updated 2 years ago
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…☆14Updated 4 years ago
- ☆13Updated 3 years ago
- Pre-trained V+L Data Preparation☆45Updated 4 years ago
- Code and data for the project "Visually grounded continual learning of compositional semantics"☆21Updated last year
- ☆45Updated last year
- Code for Unsupervised Discovery of Multimodal Links in Multi-Image/Multi-Sentence Documents☆30Updated 4 years ago
- Data of ACL 2019 Paper "Expressing Visual Relationships via Language".☆62Updated 3 years ago
- ☆13Updated this week
- Dataset and models for paper "Game-Based Video-Context Dialogue (EMNLP 2018)"☆19Updated 5 years ago
- Official Github Repo for the Findings of EMNLP 2021 paper "An animated picture says at least a thousand words: Selecting Gif-based Replie…☆32Updated 2 years ago
- Implementation of Soft-Label Chain Conditional Random Field for Phrase Grounding in PyTorch☆15Updated last year
- INSET: Sentence Infilling with Inter-sentential Transformer☆30Updated 3 years ago