dialogtekgeek/AVSD-DSTC10_Official

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dialogtekgeek/AVSD-DSTC10_Official)

dialogtekgeek / AVSD-DSTC10_Official

Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)

☆27

Alternatives and similar repositories for AVSD-DSTC10_Official

Users that are interested in AVSD-DSTC10_Official are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dialogtekgeek / AudioVisualSceneAwareDialog
View on GitHub
☆27May 4, 2020Updated 6 years ago
salesforce / BiST
View on GitHub
Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)
☆11Jun 16, 2025Updated last year
alexa / alexa-with-dstc10-track2-dataset
View on GitHub
DSTC10 Track 2 - Knowledge-grounded Task-oriented Dialogue Modeling on Spoken Conversations
☆64Jul 25, 2023Updated 3 years ago
jaeyun95 / pre-trained-vlk-model
View on GitHub
pre-trained vision and language model summary
☆12Apr 20, 2021Updated 5 years ago
facebookresearch / simmc2
View on GitHub
Code for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations
☆109Nov 12, 2022Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ictnlp / DSTC8-AVSD
View on GitHub
We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD…
☆56Jun 12, 2023Updated 3 years ago
hudaAlamri / DSTC7-Audio-Visual-Scene-Aware-Dialog-AVSD-Challenge
View on GitHub
☆54Nov 18, 2019Updated 6 years ago
yixinL7 / Refactoring-Summarization
View on GitHub
☆18Apr 11, 2021Updated 5 years ago
rowanz / merlot_reserve
View on GitHub
Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"
☆146Jun 1, 2022Updated 4 years ago
nervjack2 / Speech2Unit
View on GitHub
☆13Sep 25, 2024Updated last year
lizekang / DSTC10-MOD
View on GitHub
DSTC10 Track1 - MOD: Internet Meme Incorporated Open-domain Dialog
☆51Feb 16, 2023Updated 3 years ago
abc403 / SMCA-replication
View on GitHub
SMCA replication
☆21Jul 24, 2021Updated 5 years ago
zinengtang / Perceiver_VL
View on GitHub
PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)
☆34Feb 5, 2023Updated 3 years ago
facebookresearch / codraw-models
View on GitHub
Models for the Collaborative Drawing (CoDraw) task
☆14Jan 15, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
gaopengcuhk / Container
View on GitHub
Official Code Release for Container : Context Aggregation Network
☆46Oct 17, 2021Updated 4 years ago
henryhungle / MTN
View on GitHub
Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)
☆100Oct 17, 2022Updated 3 years ago
alexa / alexa-with-dstc9-track1-dataset
View on GitHub
DSTC9 Track 1 - Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access
☆106Jun 12, 2023Updated 3 years ago
iwhwang / SelecMix
View on GitHub
SelecMix: Debiased Learning by Contradicting-pair Sampling (NeurIPS 2022)
☆13Jun 5, 2024Updated 2 years ago
allenai / container
View on GitHub
☆57Oct 17, 2021Updated 4 years ago
simpleshinobu / visdial-principles
View on GitHub
Implementation for CVPR 2020 Paper "Two Causal Principles for Improving Visual Dialog"
☆31Feb 19, 2023Updated 3 years ago
gicheonkang / prograsp
View on GitHub
🦾 PyTorch Implementation for the ICRA'24 Paper, "PROGrasp: Pragmatic Human-Robot Communication for Object Grasping"
☆15May 5, 2025Updated last year
Adit31 / Captionomaly-Deep-Learning-Toolbox-for-Anomaly-Captioning
View on GitHub
Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos
☆13Jun 26, 2023Updated 3 years ago
KimHyeonwoo / go-hangul
View on GitHub
A package for Hangul (korean alphabet)
☆13Dec 19, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
MingLunHan / CIF-ColDec
View on GitHub
[ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection
☆25Jul 14, 2026Updated 2 weeks ago
pku-sixing / ACL2020-ConKADI
View on GitHub
Resources for the our ACL2020 paper, Diverse and Informative Dialogue Generation with Context-Specific Commonsense Knowledge Awareness
☆40Nov 8, 2020Updated 5 years ago
AlenUbuntu / Awesome-Vision-and-Language-PreTrain-Papers
View on GitHub
☆14Dec 25, 2020Updated 5 years ago
gicheonkang / dan-visdial
View on GitHub
✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"
☆44Mar 19, 2023Updated 3 years ago
FrankFundel / SGCond
View on GitHub
☆10Jun 28, 2023Updated 3 years ago
idansc / mrr-ndcg
View on GitHub
☆18Jun 10, 2024Updated 2 years ago
ablodge / leamr
View on GitHub
A structurally comprehensive dataset of AMR-to-text alignments for coverage of a larger variety of linguistic phenomena, for research rel…
☆16Dec 10, 2022Updated 3 years ago
j-min / VL-T5
View on GitHub
PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)
☆372Jul 29, 2023Updated 3 years ago
RoboCupAtHome / vizbox
View on GitHub
A visualization system for RoboCup@Home robots
☆10Jul 12, 2019Updated 7 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
facebookresearch / DVDialogues
View on GitHub
Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue
☆14Oct 12, 2021Updated 4 years ago
piekey1994 / IOM
View on GitHub
Information-oriented Metric (IOM)
☆11Sep 2, 2020Updated 5 years ago
hehefan / video-classification
View on GitHub
TensorFlow implementation for video classification.
☆44Jul 7, 2018Updated 8 years ago
JinchaoLove / CUHK-PhD-Thesis-Template
View on GitHub
Latex template for CUHK PhD Thesis
☆14Jun 29, 2025Updated last year
cutopia-labs / CUtopia
View on GitHub
Course review and timetable planning platform used by thousands of CUHK students
☆13Aug 19, 2024Updated last year
huangxt39 / BART_on_COVID_dialogue
View on GitHub
Fine-tuning BART on COVID Dialogue Dataset
☆17Apr 8, 2020Updated 6 years ago
LaVi-Lab / Visual-Table
View on GitHub
[EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"
☆20Oct 17, 2024Updated last year