thuhcsi/mm2022-conversational-tts

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thuhcsi/mm2022-conversational-tts)

thuhcsi / mm2022-conversational-tts

☆11

Alternatives and similar repositories for mm2022-conversational-tts

Users that are interested in mm2022-conversational-tts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

walker-hyf / FCTalker
View on GitHub
FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)
☆26Feb 22, 2024Updated 2 years ago
circle-hit / MuCDN
View on GitHub
Code for COLING 2022 accepted paper titled "MuCDN: Mutual Conversational Detachment Network for Emotion Recognition in Multi-Party Conver…
☆10Jul 21, 2023Updated 3 years ago
bronichern / DeepFry
View on GitHub
☆13Jun 29, 2025Updated last year
keonlee9420 / Daft-Exprt
View on GitHub
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
☆55Oct 15, 2021Updated 4 years ago
NeuroWave-ai / CUCVAE-TTS
View on GitHub
☆25Mar 12, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
walker-hyf / ECSS
View on GitHub
Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)
☆59Jun 20, 2024Updated 2 years ago
light1726 / SpeechTripleNet
View on GitHub
The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"
☆33Nov 23, 2023Updated 2 years ago
hs-oh-prml / EmotionControllableTextToSpeech
View on GitHub
☆21Jun 16, 2021Updated 5 years ago
thuhcsi / english-conversation-corpus
View on GitHub
English conversation corpus for conversational TTS.
☆21Mar 13, 2023Updated 3 years ago
yujxx / PodEval
View on GitHub
A comprehensive toolkit for podcast evaluation. https://arxiv.org/abs/2510.00485
☆21Dec 9, 2025Updated 7 months ago
BridgetteSong / BunchedLPCnet
View on GitHub
This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.
☆14Jun 17, 2021Updated 5 years ago
XingBowen714 / DARER
View on GitHub
PyTorch source Code for our ACL 2022 paper: DARER: Dual-task Temporal Relational Recurrent Reasoning Network for Joint Dialog Sentiment C…
☆15Mar 30, 2023Updated 3 years ago
AlexK-PL / GST_Tacotron2
View on GitHub
A NVIDIA's Pytorch Tacotron2 adaptation with unsupervised Global Style Tokens. The model has been trained with the English read-speech LJ…
☆10Sep 4, 2023Updated 2 years ago
sileod / DiscSense
View on GitHub
Automated Semantic Analysis of Discourse Markers
☆11May 30, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pzelasko / daseg
View on GitHub
Dialog Acts SEGmentation: Tools for dialog act research
☆14Mar 21, 2025Updated last year
ddlBoJack / MT4SSL
View on GitHub
[INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…
☆45Mar 25, 2024Updated 2 years ago
ttslr / MonTTS
View on GitHub
☆16Dec 23, 2021Updated 4 years ago
line / promptttspp
View on GitHub
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions
☆86Oct 11, 2024Updated last year
MU94W / TTS-Eval
View on GitHub
☆18Aug 9, 2018Updated 7 years ago
HuiyuanXie / tiage
View on GitHub
☆19Jul 20, 2022Updated 4 years ago
KathyReid / cvaccents
View on GitHub
A set of tools for working with accent data in Mozilla's Common Voice dataset
☆14Nov 3, 2023Updated 2 years ago
Takaaki-Saeki / simplified_neural_source_filter
View on GitHub
PyTorch implementation of simplified neural source filter model (s-nsf)
☆14Aug 4, 2021Updated 4 years ago
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ajinkyakulkarni14 / ERISHA
View on GitHub
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆44Dec 17, 2020Updated 5 years ago
francislata / unicats
View on GitHub
An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".
☆26Nov 4, 2023Updated 2 years ago
Labmem-Zhouyx / CDFSE_FastSpeech2
View on GitHub
The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…
☆86Dec 20, 2022Updated 3 years ago
ritheshkumar95 / minimal_diffusion_models
View on GitHub
☆16Dec 31, 2021Updated 4 years ago
rhoposit / multilingual_VQVAE
View on GitHub
☆37May 8, 2021Updated 5 years ago
cpdu / unicats
View on GitHub
☆63Jan 15, 2024Updated 2 years ago
revsic / torch-diffusion-wavegan
View on GitHub
Parallel waveform generation with DiffusionGAN
☆17Mar 26, 2022Updated 4 years ago
thuhcsi / DiffVar
View on GitHub
☆30Aug 12, 2023Updated 2 years ago
ivanvovk / compressed-tacotron2-pytorch
View on GitHub
Compressed version of Tacotron 2 using Tensor Train + Waveglow.
☆22Dec 26, 2019Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Coder-jzq / RADKA-CSS
View on GitHub
☆17Mar 25, 2025Updated last year
NamSahng / SingingStyleTransfer
View on GitHub
Singing Style Transfer using Deep U-net for vocal separation & CycleConsistencyBoundaryEquilibrium GAN(Cycle-BEGAN) for vocal style trans…
☆34Sep 17, 2019Updated 6 years ago
xcmyz / ConvTasNet4BasisMelGAN
View on GitHub
This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.
☆21Jul 21, 2021Updated 5 years ago
keonlee9420 / DailyTalk
View on GitHub
Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023
☆260Jun 5, 2025Updated last year
KinglittleQ / GST-Tacotron
View on GitHub
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
☆374Dec 8, 2022Updated 3 years ago
wenet-e2e / wecut
View on GitHub
video cut powered by AI
☆23Nov 15, 2022Updated 3 years ago
zzw922cn / wesinger2
View on GitHub
Synthesized singing voice demos of WeSinger 2 paper.
☆26Feb 20, 2023Updated 3 years ago