Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue
☆14Oct 12, 2021Updated 4 years ago
Alternatives and similar repositories for DVDialogues
Users that are interested in DVDialogues are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆27May 4, 2020Updated 5 years ago
- An open-source framework for modeling real-time conversations in spoken dialogue systems.☆27Aug 12, 2022Updated 3 years ago
- Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)☆100Oct 17, 2022Updated 3 years ago
- Code for the paper Non-Autoregressive Dialog State Tracking (ICLR20)☆44Feb 25, 2020Updated 6 years ago
- A tool to inform citizens of Berlin about changes in their neighborhood.☆21Nov 12, 2023Updated 2 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- ☆20Jul 27, 2020Updated 5 years ago
- A Corpus of Natural Language Instructions for Collaborative Manipulation☆15Feb 15, 2017Updated 9 years ago
- Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"☆10Nov 1, 2022Updated 3 years ago
- Code for the paper "A Divide-and-Conquer Approach to the Summarization of Long Documents"☆18Jun 8, 2021Updated 4 years ago
- Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…☆34May 14, 2020Updated 5 years ago
- Implementation of the AAAI-21 Workshop on Scientific Document Understanding paper "A Paragraph-level Multi-task Learning Model for Scient…☆15Oct 9, 2023Updated 2 years ago
- Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)☆11Jun 16, 2025Updated 9 months ago
- DSTC8-AVSD: Sentence generation task for Audio Visual Scene-aware Dialog☆14Jun 10, 2021Updated 4 years ago
- Deep Multi-Speech model☆11Jul 25, 2018Updated 7 years ago
- Your personalized retrieval engine☆29Jan 4, 2022Updated 4 years ago
- Model of three-way decision☆13Jun 12, 2020Updated 5 years ago
- a close enough approximation of the shadertoy framework☆12Jul 2, 2020Updated 5 years ago
- ☆54Nov 18, 2019Updated 6 years ago
- End-to-end Multi-modal Video Temporal Grounding, NeurIPS 2021☆18Oct 24, 2021Updated 4 years ago
- ☆15Aug 20, 2024Updated last year
- Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos☆13Jun 26, 2023Updated 2 years ago
- Repo from the "Learning with limited labeled data" seminar @ Uni of Tuebingen. A collection of notes, notebooks and slideshows to underst…☆17Apr 13, 2023Updated 2 years ago
- A package for Hangul (korean alphabet)☆13Dec 19, 2022Updated 3 years ago
- A simple pyaudio microphone interface☆11Jul 27, 2018Updated 7 years ago
- ☆10Nov 2, 2023Updated 2 years ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Feb 27, 2024Updated 2 years ago
- PyTorch implementation of "PatchVAE: Learning Local Latent Codes for Recognition" to appear in CVPR 2020☆14Apr 9, 2020Updated 5 years ago
- ☆13Jun 7, 2024Updated last year
- ☆14Jun 18, 2025Updated 9 months ago
- Dataset and models for paper "Game-Based Video-Context Dialogue (EMNLP 2018)"☆19Oct 25, 2018Updated 7 years ago
- Sort vim folds based on their first line.☆13May 19, 2019Updated 6 years ago
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆26May 12, 2024Updated last year
- A structurally comprehensive dataset of AMR-to-text alignments for coverage of a larger variety of linguistic phenomena, for research rel…☆16Dec 10, 2022Updated 3 years ago
- We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD…☆56Jun 12, 2023Updated 2 years ago
- ROCK model for Knowledge-Based VQA in Videos☆31Oct 19, 2020Updated 5 years ago
- Multi-faceted Video Moment Localizer☆17Jun 19, 2020Updated 5 years ago
- ☆11Dec 20, 2023Updated 2 years ago
- "Describing Textures using Natural Language" code and data, ECCV 2020 Oral.☆17Aug 6, 2020Updated 5 years ago