A2Zadeh / Social-IQ
☆38Updated this week
Related projects: ⓘ
- [EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction☆47Updated 2 years ago
- Code for the model "Heterogeneous Graph Learning for Visual Commonsense Reasoning (NeurlPS 2019)"☆46Updated 4 years ago
- Neural State Machine implemented in PyTorch☆70Updated 4 years ago
- Code for ''A Simple Baseline for Audio-Visual Scene-Aware Dialog``☆25Updated 4 years ago
- ☆45Updated last year
- Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog☆44Updated 4 years ago
- ☆29Updated 2 years ago
- VisualCOMET: Reasoning about the Dynamic Context of a Still Image☆85Updated last year
- Code and dataset of "MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos" in MM'20.☆50Updated last year
- [ICLR 2019] Learning Factorized Multimodal Representations☆65Updated 4 years ago
- ☆40Updated last year
- NeurIPS 2019 Paper: RUBi : Reducing Unimodal Biases for Visual Question Answering☆59Updated 3 years ago
- ☆40Updated this week
- Multi-sense word embeddings from visual co-occurrences☆25Updated 5 years ago
- Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671☆19Updated 3 years ago
- [CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)☆57Updated 3 years ago
- Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)☆23Updated 4 years ago
- A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.☆37Updated 2 years ago
- Visual Dialog: Light-weight Transformer for Many Inputs (ECCV 2020)☆30Updated 3 years ago
- ☆15Updated 5 years ago
- Code for WACV 2021 Paper "Meta Module Network for Compositional Visual Reasoning"☆43Updated 3 years ago
- Implementation for "Large-scale Pretraining for Visual Dialog" https://arxiv.org/abs/1912.02379☆95Updated 4 years ago
- Data Release for VALUE Benchmark☆32Updated 2 years ago
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Updated last year
- Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT…☆21Updated 3 years ago
- ☆29Updated 3 years ago
- Repo for ICCV 2021 paper: Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering☆24Updated 2 months ago
- Connective Cognition Network for Directional Visual Commonsense Reasoning☆15Updated 3 years ago
- DSTC8-AVSD: Sentence generation task for Audio Visual Scene-aware Dialog☆14Updated 3 years ago
- Diverse Image Captioning with Context-Object Split Latent Spaces (NeurIPS 2020)☆37Updated 2 years ago