Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)
☆27Aug 19, 2022Updated 3 years ago
Alternatives and similar repositories for AVSD-DSTC10_Official
Users that are interested in AVSD-DSTC10_Official are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆27May 4, 2020Updated 6 years ago
- ☆16May 6, 2021Updated 5 years ago
- DSTC10 Track 2 - Knowledge-grounded Task-oriented Dialogue Modeling on Spoken Conversations☆62Jul 25, 2023Updated 2 years ago
- pre-trained vision and language model summary☆12Apr 20, 2021Updated 5 years ago
- Code for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations☆108Nov 12, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)☆11Jun 16, 2025Updated last year
- We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD…☆56Jun 12, 2023Updated 3 years ago
- ☆54Nov 18, 2019Updated 6 years ago
- ☆18Apr 11, 2021Updated 5 years ago
- Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"☆146Jun 1, 2022Updated 4 years ago
- ☆13Sep 25, 2024Updated last year
- DSTC10 Track1 - MOD: Internet Meme Incorporated Open-domain Dialog☆51Feb 16, 2023Updated 3 years ago
- Models for the Collaborative Drawing (CoDraw) task☆14Jan 15, 2019Updated 7 years ago
- Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)☆100Oct 17, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Code Release for Container : Context Aggregation Network☆46Oct 17, 2021Updated 4 years ago
- PyTorch implementation of the Reinforced Mnemonic Reader + Answer Verifier model (https://arxiv.org/abs/1808.05759)☆10Nov 23, 2018Updated 7 years ago
- DSTC9 Track 1 - Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access☆106Jun 12, 2023Updated 3 years ago
- ☆57Oct 17, 2021Updated 4 years ago
- SelecMix: Debiased Learning by Contradicting-pair Sampling (NeurIPS 2022)☆13Jun 5, 2024Updated 2 years ago
- Implementation for CVPR 2020 Paper "Two Causal Principles for Improving Visual Dialog"☆31Feb 19, 2023Updated 3 years ago
- The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)☆16Jan 2, 2023Updated 3 years ago
- Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos☆13Jun 26, 2023Updated 2 years ago
- ☆59Feb 20, 2022Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A package for Hangul (korean alphabet)☆13Dec 19, 2022Updated 3 years ago
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25May 18, 2023Updated 3 years ago
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆21Dec 14, 2025Updated 6 months ago
- ☆14Dec 25, 2020Updated 5 years ago
- Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)☆11Sep 17, 2023Updated 2 years ago
- ✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"☆44Mar 19, 2023Updated 3 years ago
- Information-oriented Metric (IOM)☆11Sep 2, 2020Updated 5 years ago
- ☆10Jun 28, 2023Updated 2 years ago
- ☆18Jun 10, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A structurally comprehensive dataset of AMR-to-text alignments for coverage of a larger variety of linguistic phenomena, for research rel…☆16Dec 10, 2022Updated 3 years ago
- Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…☆34May 14, 2020Updated 6 years ago
- K-Means algorithm in the Poincare Disk Model☆16Nov 12, 2018Updated 7 years ago
- [AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning☆68Feb 16, 2024Updated 2 years ago
- Latex template for CUHK PhD Thesis☆14Jun 29, 2025Updated 11 months ago
- PyTorch code for Learning to Caption Images through a Lifetime by Asking Questions (ICCV 2019)☆16Sep 17, 2019Updated 6 years ago
- ☆21Sep 10, 2021Updated 4 years ago