pytorch implementation of DNN-HSMM for TTS
☆71Mar 14, 2021Updated 5 years ago
Alternatives and similar repositories for DNN-HSMM
Users that are interested in DNN-HSMM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Jun 27, 2018Updated 7 years ago
- GlottDNN vocoder and tools for training DNN excitation models☆33Feb 27, 2021Updated 5 years ago
- ☆69Mar 31, 2021Updated 5 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Oct 14, 2019Updated 6 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 6 years ago
- Official implementation of BVAE-TTS☆173Sep 26, 2022Updated 3 years ago
- using world vocoder to extract features and make data for training neural networks☆11Oct 9, 2017Updated 8 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- Neural HMMs are all you need (for high-quality attention-free TTS)☆165Apr 27, 2026Updated 3 weeks ago
- ☆199May 3, 2024Updated 2 years ago
- ☆45Dec 16, 2019Updated 6 years ago
- ☆26Apr 21, 2021Updated 5 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A fork of sinsy: HMM/DNN-based singing voice synthesis system☆74Feb 6, 2022Updated 4 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- Charsiu: A neural phonetic aligner.☆343Sep 19, 2022Updated 3 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别☆12Oct 25, 2020Updated 5 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- ☆10Dec 10, 2021Updated 4 years ago
- WaveNet implementation using tf.estimator☆21Jul 6, 2023Updated 2 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆43May 9, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆12Jul 6, 2023Updated 2 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- Implementation of the AlignTTS☆77Jul 6, 2023Updated 2 years ago
- A vocoder framework which had been widely used in research community since 1999.☆186Dec 24, 2018Updated 7 years ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆14Feb 5, 2025Updated last year
- Read and write HTK and HTS files from python.☆20Mar 17, 2015Updated 11 years ago
- Yin pitch estimator in PyTorch☆118Nov 7, 2022Updated 3 years ago
- RawNet: Fast End-to-End Neural Vocoder☆43May 29, 2019Updated 6 years ago
- context labels and pronunciation data for JSUT corpus☆77Sep 2, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆191Nov 18, 2021Updated 4 years ago
- A library for hidden semi-Markov models with explicit durations☆88Aug 21, 2021Updated 4 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Sep 10, 2021Updated 4 years ago
- Gaussian Mixture VAE Tacotron☆54Jul 6, 2023Updated 2 years ago
- Voice conversion tools for STRAIGHT☆29Jul 17, 2020Updated 5 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆169Jul 6, 2023Updated 2 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago