Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)
☆27Aug 19, 2022Updated 3 years ago
Alternatives and similar repositories for AVSD-DSTC10_Official
Users that are interested in AVSD-DSTC10_Official are comparing it to the libraries listed below
Sorting:
- ☆27May 4, 2020Updated 5 years ago
- We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD…☆56Jun 12, 2023Updated 2 years ago
- ☆54Nov 18, 2019Updated 6 years ago
- ☆13Sep 25, 2024Updated last year
- Code for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations☆106Nov 12, 2022Updated 3 years ago
- pre-trained vision and language model summary☆12Apr 20, 2021Updated 4 years ago
- K-Means algorithm in the Poincare Disk Model☆15Nov 12, 2018Updated 7 years ago
- ☆16May 6, 2021Updated 4 years ago
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25May 18, 2023Updated 2 years ago
- DSTC10 Track1 - MOD: Internet Meme Incorporated Open-domain Dialog☆50Feb 16, 2023Updated 3 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Feb 27, 2020Updated 6 years ago
- Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"☆146Jun 1, 2022Updated 3 years ago
- Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)☆100Oct 17, 2022Updated 3 years ago
- DSTC9 Track 1 - Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access☆105Jun 12, 2023Updated 2 years ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆34Feb 5, 2023Updated 3 years ago
- ☆31Jun 19, 2020Updated 5 years ago
- ACL 2020 Website☆32Jul 8, 2020Updated 5 years ago
- Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…☆34May 14, 2020Updated 5 years ago
- Implementation for CVPR 2020 Paper "Two Causal Principles for Improving Visual Dialog"☆31Feb 19, 2023Updated 3 years ago
- Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)☆11Sep 17, 2023Updated 2 years ago
- TikTok Clone app made with Flutter and Firebase☆17Sep 21, 2023Updated 2 years ago
- practical guides, tutorials, and code samples for ml4a☆10Mar 26, 2019Updated 6 years ago
- ☆10Jun 28, 2023Updated 2 years ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆12Aug 1, 2025Updated 7 months ago
- Super Flappy Bird in p5.js☆10Mar 8, 2021Updated 5 years ago
- Information-oriented Metric (IOM)☆11Sep 2, 2020Updated 5 years ago
- Resources for our IJCAI 2020 paper, TopicKA: Generating Commonsense Knowledge-Aware Dialogue Responses Towards the Recommended Topic Fact☆12Nov 30, 2020Updated 5 years ago
- Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos☆13Jun 26, 2023Updated 2 years ago
- ☆10Nov 9, 2020Updated 5 years ago
- PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)☆374Jul 29, 2023Updated 2 years ago
- This repo contains the code for paper "nuCarla: A nuScenes-Style Bird’s-Eye View Perception Dataset for CARLA Simulation"☆31Jan 2, 2026Updated 2 months ago
- Image and video processing toolbox☆10Jun 12, 2020Updated 5 years ago
- Bookdown Tutorial for R Package Validation Framework☆11Nov 5, 2021Updated 4 years ago
- ☆12Apr 26, 2021Updated 4 years ago
- Repository for Course 60-212 at CMU☆12Updated this week
- Repository for SoMeLVLM: A Large Vision Language Model for Social Media Processing☆13Oct 9, 2025Updated 5 months ago
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 2 months ago
- Marching Squares implementation for Processing based on https://github.com/murphydactyl/JavaKinectFingerTracker/☆13Sep 10, 2012Updated 13 years ago
- ☆10Jan 3, 2023Updated 3 years ago