nwpuaslp/TTS_Course

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nwpuaslp/TTS_Course)

nwpuaslp / TTS_Course

☆70

Alternatives and similar repositories for TTS_Course

Users that are interested in TTS_Course are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

azraelkuan / repgan
View on GitHub
RepVgg + HiFiGAN
☆36Aug 10, 2022Updated 3 years ago
li1jkdaw / LPCNet_parallel
View on GitHub
Simulation of parallel synthesis with LPCNet vocoder
☆14May 5, 2020Updated 6 years ago
nwpuaslp / ASR_Course
View on GitHub
☆149Aug 2, 2020Updated 5 years ago
xcmyz / FastVocoder
View on GitHub
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
☆157Jul 2, 2021Updated 5 years ago
entn-at / DurIAN-1
View on GitHub
Implementation of "DurIAN: Duration Informed Attention Network For Multimodal Synthesis".
☆15Jul 6, 2020Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
yerfor / SyntaSpeech
View on GitHub
SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code
☆201Sep 4, 2022Updated 3 years ago
xinshengwang / ICASSP2021_paper_list-VC
View on GitHub
ICASSP 2021 accepted papers in term of voice conversion (VC)
☆18Apr 11, 2021Updated 5 years ago
MingjieChen / EasyVC
View on GitHub
A toolkit for any-to-any encoder-decoder voice conversion systems
☆83Aug 10, 2023Updated 2 years ago
speechio / chinese_text_normalization
View on GitHub
Chinese text normalization for speech processing
☆733Mar 18, 2023Updated 3 years ago
kakaobrain / g2pm
View on GitHub
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
☆367Dec 24, 2021Updated 4 years ago
thuhcsi / FlatTN
View on GitHub
Chinese Text Normalization and Dataset
☆91May 14, 2022Updated 4 years ago
lmxue / NVV-SuperBench
View on GitHub
NVV-SuperBench: Beyond Words, Beyond Quality—Benchmarking Nonverbal Vocalizations in Speech Generation (Interspeech 2026 long paper)
☆18Jun 21, 2026Updated last month
cpuimage / Tacotron-2
View on GitHub
Tensorflow implementation of DeepMind's Tacotron-2 (without wavenet)
☆11Jul 12, 2019Updated 7 years ago
MlWoo / sentence2pinyin
View on GitHub
tts fronted-end
☆11Dec 19, 2018Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
liusongxiang / efficient_tts
View on GitHub
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
☆116Dec 22, 2021Updated 4 years ago
yanggeng1995 / Multi-band-WaveRNN
View on GitHub
☆45Dec 16, 2019Updated 6 years ago
candlewill / CNTN
View on GitHub
ChiNese Text Normalization (CNTN) tool for Text-to-speech system
☆37Apr 12, 2018Updated 8 years ago
thuhcsi / Crystal
View on GitHub
Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
☆230Aug 17, 2020Updated 5 years ago
X-LANCE / StoryTTS
View on GitHub
[ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations
☆141Apr 27, 2024Updated 2 years ago
tts-tutorial / book
View on GitHub
☆64Sep 18, 2022Updated 3 years ago
wenet-e2e / opencpop
View on GitHub
Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis
☆236Dec 10, 2025Updated 7 months ago
lmxue / ICASSP2022_TTS_VC_Summary
View on GitHub
ICASSP2022 TTS&VC Summary
☆13Jun 9, 2022Updated 4 years ago
lucasjinreal / textfrontend
View on GitHub
单独维护的中文TTS
☆34Oct 28, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
fss1t / CausalStarGANv2-VC
View on GitHub
☆22Apr 4, 2023Updated 3 years ago
Tencent / SongBench
View on GitHub
☆50Apr 30, 2026Updated 2 months ago
sos1sos2Sixteen / aishell-3-baseline-fc
View on GitHub
The code for aishell-3 baseline acoustic model
☆70Nov 30, 2020Updated 5 years ago
rishikksh20 / vae_tacotron2
View on GitHub
VAE Tacotron 2, an alternative of GST Tacotron
☆91Jul 6, 2023Updated 3 years ago
guanlongzhao / fac-via-ppg
View on GitHub
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)
☆147Jul 6, 2023Updated 3 years ago
zjwang21 / mix-phoneme-bert
View on GitHub
An unofficial PyTorch implementation of Mix-Phoneme-Bert
☆40Jul 10, 2023Updated 3 years ago
shang0712 / HierTTS
View on GitHub
☆47Apr 16, 2023Updated 3 years ago
ivanvovk / durian-pytorch
View on GitHub
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.
☆184Aug 12, 2020Updated 5 years ago
huawei-noah / Speech-Backbones
View on GitHub
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
☆604Sep 18, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
makerjackie / tts-frontend-dataset
View on GitHub
TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization
☆104Feb 5, 2024Updated 2 years ago
b04901014 / MQTTS
View on GitHub
☆260May 15, 2023Updated 3 years ago
wenet-e2e / speech-synthesis-paper
View on GitHub
List of speech synthesis papers.
☆1,074Jul 24, 2023Updated 2 years ago
MasayaKawamura / MB-iSTFT-VITS
View on GitHub
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
☆469Nov 17, 2022Updated 3 years ago
anonymous-pits / pits
View on GitHub
PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
☆280Jul 16, 2023Updated 3 years ago
LeoniusChen / Attentions-in-Tacotron
View on GitHub
☆69Mar 31, 2021Updated 5 years ago
BoragoCode / AttentionBasedProsodyPrediction
View on GitHub
Encoder and Decoder and Attention Based Prosody Prediction
☆68Jan 17, 2018Updated 8 years ago