A collection of plain text dialogue datasets
☆97Aug 13, 2018Updated 7 years ago
Alternatives and similar repositories for Dialogue-Datasets
Users that are interested in Dialogue-Datasets are comparing it to the libraries listed below
Sorting:
- The Self-dialogue Corpus - a collection of self-dialogues across music, movies and sports☆107Mar 19, 2024Updated last year
- A repository linking to publicly available dialog datasets. Feel free to send pull requests.☆69Feb 2, 2022Updated 4 years ago
- CVAE_XGate model in paper "Xu, Dusek, Konstas, Rieser. Better Conversations by Modeling, Filtering, and Optimizing for Coherence and Dive…☆16Jan 23, 2020Updated 6 years ago
- Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.☆19Jan 21, 2021Updated 5 years ago
- Portal Tutorial☆11Feb 3, 2018Updated 8 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- ☆12Dec 20, 2018Updated 7 years ago
- Emoji-cheat-sheet converter for Python☆10Dec 29, 2014Updated 11 years ago
- ACL2019:Learning to Abstract for Memory-augmented Conversational Response Generation☆19Sep 14, 2019Updated 6 years ago
- Chatbot using Tensorflow (Model is transformer) ko☆30Dec 10, 2018Updated 7 years ago
- ☆11Oct 3, 2021Updated 4 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- 모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.☆11Mar 2, 2022Updated 4 years ago
- ☆12Oct 2, 2020Updated 5 years ago
- ☆26Jan 27, 2018Updated 8 years ago
- 한국어 문서에 노이즈를 추가합니다.☆27Nov 9, 2022Updated 3 years ago
- This repo is containing notes and implementations for cherry-picked publications of my particular interest☆12May 14, 2020Updated 5 years ago
- GPT-jax based on the official huggingface library☆13Jun 22, 2021Updated 4 years ago
- Korean large emotion labeled dataset (EmoNSMC)☆14Mar 5, 2020Updated 5 years ago
- Tensorflow implementation of HRED (Hierarchical Recurrent Encoder-decoder).☆20Jan 29, 2019Updated 7 years ago
- Source code for EMNLP'25 paper "CodeRAG: Finding Relevant and Necessary Knowledge for Retrieval-Augmented Repository-Level Code Completio…☆18Jan 18, 2026Updated last month
- Fast and differentiable hidden Markov model in C++☆19Jan 20, 2023Updated 3 years ago
- Filter dialog data with a simple entropy-based method (see ACL paper)☆14Oct 4, 2019Updated 6 years ago
- 나무위키, 위키피디아, 다음블로그, 티스토리, 유튜브, 네이트판 크롤러☆12Feb 20, 2026Updated last week
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Mar 14, 2018Updated 7 years ago
- ☆14Jul 8, 2018Updated 7 years ago
- Grounded conversational dataset for end-to-end conversational AI (official DSTC7 data)☆175Aug 20, 2024Updated last year
- Wave2vec 2.0 Recognize pipeline☆33Dec 22, 2020Updated 5 years ago
- ☆15Nov 30, 2020Updated 5 years ago
- Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if tok…☆18Oct 27, 2020Updated 5 years ago
- Code for the paper - Controlling Dialogue Generation with Semantic Exemplars (Naacl 2021) A semantic exemplar based retrieve-refine appro…☆18Mar 26, 2021Updated 4 years ago
- Dialogue corpus creation and evaluation scripts for the Ubuntu Dialogue Corpus.☆15Jun 9, 2023Updated 2 years ago
- Convert Numerical Representations to Korean Pronunciation☆14Apr 20, 2020Updated 5 years ago
- PyTorch implementation for Interpretable Dialog Generation ACL 2018, It is released by Tiancheng Zhao (Tony) from Dialog Research Center…☆197Jan 14, 2019Updated 7 years ago
- A collection of all my datasets☆238Jun 7, 2018Updated 7 years ago
- A Python package for audio annotation and classifier training. Developed in collaboration with the WGBH Foundation and the American Archi…☆17Jun 2, 2018Updated 7 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Jan 2, 2020Updated 6 years ago
- Set of guides and references for annotating NLP data☆16Apr 27, 2022Updated 3 years ago