Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)
☆27Aug 19, 2022Updated 3 years ago
Alternatives and similar repositories for AVSD-DSTC10_Official
Users that are interested in AVSD-DSTC10_Official are comparing it to the libraries listed below
Sorting:
- ☆27May 4, 2020Updated 5 years ago
- DSTC10 Track 2 - Knowledge-grounded Task-oriented Dialogue Modeling on Spoken Conversations☆62Jul 25, 2023Updated 2 years ago
- We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD…☆56Jun 12, 2023Updated 2 years ago
- ☆54Nov 18, 2019Updated 6 years ago
- Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)☆11Jun 16, 2025Updated 8 months ago
- ☆13Sep 25, 2024Updated last year
- Code for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations☆106Nov 12, 2022Updated 3 years ago
- ☆18Apr 11, 2021Updated 4 years ago
- ☆16May 6, 2021Updated 4 years ago
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25May 18, 2023Updated 2 years ago
- DSTC10 Track1 - MOD: Internet Meme Incorporated Open-domain Dialog☆50Feb 16, 2023Updated 3 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Feb 27, 2020Updated 6 years ago
- Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"☆146Jun 1, 2022Updated 3 years ago
- Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)☆100Oct 17, 2022Updated 3 years ago
- DSTC9 Track 1 - Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access☆105Jun 12, 2023Updated 2 years ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆34Feb 5, 2023Updated 3 years ago
- ☆31Jun 19, 2020Updated 5 years ago
- ACL 2020 Website☆32Jul 8, 2020Updated 5 years ago
- Implementation for CVPR 2020 Paper "Two Causal Principles for Improving Visual Dialog"☆31Feb 19, 2023Updated 3 years ago
- TikTok Clone app made with Flutter and Firebase☆17Sep 21, 2023Updated 2 years ago
- ☆10Nov 9, 2020Updated 5 years ago
- Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)☆11Sep 17, 2023Updated 2 years ago
- Super Flappy Bird in p5.js☆10Mar 8, 2021Updated 5 years ago
- ☆10Feb 13, 2023Updated 3 years ago
- PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)☆374Jul 29, 2023Updated 2 years ago
- Neural network sequence labeling model - some sloppy modifications to the original toolkit to enable punctuation restoration in unsegment…☆10Jan 8, 2017Updated 9 years ago
- Provides access to NASA's Exoplanet Archive☆14Sep 18, 2022Updated 3 years ago
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 2 months ago
- Crawl traffic data from PEMS☆10Jul 19, 2021Updated 4 years ago
- Course review and timetable planning platform used by thousands of CUHK students☆13Aug 19, 2024Updated last year
- Phonetically balanced text to speech sentences☆10Aug 16, 2021Updated 4 years ago
- Compressed ML Math material based on the book "Mathematics for Machine Learning" and other resources.☆10Jan 7, 2020Updated 6 years ago
- CopyNet (Copy Mechanism in Seq2Seq) implementation with TensorFlow 2☆10Nov 21, 2022Updated 3 years ago
- A package for Hangul (korean alphabet)☆13Dec 19, 2022Updated 3 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 2 years ago
- Image and video processing toolbox☆10Jun 12, 2020Updated 5 years ago
- Simple Online Realtime Tracking with a Deep Association Metric☆11Jul 26, 2017Updated 8 years ago
- Repository for Course 60-212 at CMU☆12Updated this week
- ☆10Jul 25, 2023Updated 2 years ago