microsoft/NeuralSpeech

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/NeuralSpeech)

microsoft / NeuralSpeech

☆1,461

Alternatives and similar repositories for NeuralSpeech

Users that are interested in NeuralSpeech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

heatz123 / naturalspeech
View on GitHub
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
☆475Feb 7, 2024Updated 2 years ago
huawei-noah / Speech-Backbones
View on GitHub
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
☆604Sep 18, 2023Updated 2 years ago
lucidrains / naturalspeech2-pytorch
View on GitHub
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
☆1,333Sep 24, 2023Updated 2 years ago
KevinMIN95 / StyleSpeech
View on GitHub
Official implementation of Meta-StyleSpeech and StyleSpeech
☆253Feb 9, 2022Updated 4 years ago
wenet-e2e / speech-synthesis-paper
View on GitHub
List of speech synthesis papers.
☆1,074Jul 24, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
keonlee9420 / Comprehensive-Transformer-TTS
View on GitHub
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…
☆328Sep 24, 2022Updated 3 years ago
NVIDIA / BigVGAN
View on GitHub
Official PyTorch implementation of BigVGAN (ICLR 2023)
☆1,229Sep 5, 2024Updated last year
wenet-e2e / wetts
View on GitHub
Production First and Production Ready End-to-End Text-to-Speech Toolkit
☆416Nov 20, 2025Updated 7 months ago
NATSpeech / NATSpeech
View on GitHub
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and Diff…
☆1,004Apr 2, 2023Updated 3 years ago
ZhangXInFD / SpeechTokenizer
View on GitHub
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples a…
☆661Jun 9, 2024Updated 2 years ago
b04901014 / MQTTS
View on GitHub
☆260May 15, 2023Updated 3 years ago
Rongjiehuang / ProDiff
View on GitHub
PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline
☆432Apr 19, 2023Updated 3 years ago
KdaiP / StableTTS
View on GitHub
Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3
☆437Sep 13, 2024Updated last year
jik876 / hifi-gan
View on GitHub
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
☆2,355Jul 27, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
jaywalnut310 / glow-tts
View on GitHub
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
☆712Jul 12, 2022Updated 4 years ago
NVIDIA / radtts
View on GitHub
Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, …
☆291Apr 6, 2023Updated 3 years ago
speechio / chinese_text_normalization
View on GitHub
Chinese text normalization for speech processing
☆733Mar 18, 2023Updated 3 years ago
X-LANCE / VoiceFlow-TTS
View on GitHub
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
☆375Sep 3, 2024Updated last year
yerfor / SyntaSpeech
View on GitHub
SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code
☆201Sep 4, 2022Updated 3 years ago
yl4579 / PL-BERT
View on GitHub
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
☆268Jan 13, 2025Updated last year
chomeyama / SiFiGAN
View on GitHub
Official implementation of the source-filter HiFiGAN vocoder
☆274Jul 29, 2023Updated 2 years ago
gemelo-ai / vocos
View on GitHub
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
☆1,144Aug 7, 2024Updated last year
X-LANCE / StoryTTS
View on GitHub
[ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations
☆141Apr 27, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kan-bayashi / ParallelWaveGAN
View on GitHub
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
☆1,644Apr 22, 2024Updated 2 years ago
shivammehta25 / Matcha-TTS
View on GitHub
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
☆1,331Updated this week
lifeiteng / vall-e
View on GitHub
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
☆2,205Sep 10, 2025Updated 10 months ago
ming024 / FastSpeech2
View on GitHub
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
☆2,185Oct 27, 2023Updated 2 years ago
SungFeng-Huang / Meta-TTS
View on GitHub
Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.
☆193Jun 8, 2023Updated 3 years ago
tencent-ailab / bddm
View on GitHub
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis
☆238Jul 13, 2022Updated 4 years ago
thuhcsi / VAENAR-TTS
View on GitHub
The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.
☆144Jul 8, 2021Updated 5 years ago
line / LibriTTS-P
View on GitHub
LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning
☆161Jun 13, 2024Updated 2 years ago
tts-tutorial / survey
View on GitHub
A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf
☆371Nov 5, 2021Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
yl4579 / StyleTTS
View on GitHub
Official Implementation of StyleTTS
☆465Jan 13, 2025Updated last year
sp-nitech / diffsptk
View on GitHub
A differentiable version of SPTK
☆201Updated this week
keonlee9420 / PortaSpeech
View on GitHub
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
☆342Feb 17, 2022Updated 4 years ago
liusongxiang / ppg-vc
View on GitHub
PPG-Based Voice Conversion
☆349Jul 22, 2022Updated 3 years ago
yangdongchao / AcademiCodec
View on GitHub
AcademiCodec: An Open Source Audio Codec Model for Academic Research
☆674Dec 27, 2023Updated 2 years ago
Rongjiehuang / FastDiff
View on GitHub
PyTorch Implementation of FastDiff (IJCAI'22)
☆423Jun 20, 2024Updated 2 years ago
Rongjiehuang / GenerSpeech
View on GitHub
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
☆333Feb 9, 2024Updated 2 years ago