pytorch implementation of DNN-HSMM for TTS
☆71Mar 14, 2021Updated 5 years ago
Alternatives and similar repositories for DNN-HSMM
Users that are interested in DNN-HSMM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Jun 27, 2018Updated 7 years ago
- GlottDNN vocoder and tools for training DNN excitation models☆33Feb 27, 2021Updated 5 years ago
- ☆69Mar 31, 2021Updated 5 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Oct 14, 2019Updated 6 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 6 years ago
- Official implementation of BVAE-TTS☆173Sep 26, 2022Updated 3 years ago
- using world vocoder to extract features and make data for training neural networks☆11Oct 9, 2017Updated 8 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- Neural HMMs are all you need (for high-quality attention-free TTS)☆165Apr 27, 2026Updated last week
- ☆199May 3, 2024Updated 2 years ago
- ☆45Dec 16, 2019Updated 6 years ago
- ☆26Apr 21, 2021Updated 5 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A fork of sinsy: HMM/DNN-based singing voice synthesis system☆74Feb 6, 2022Updated 4 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- Charsiu: A neural phonetic aligner.☆341Sep 19, 2022Updated 3 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别☆12Oct 25, 2020Updated 5 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- ☆10Dec 10, 2021Updated 4 years ago
- WaveNet implementation using tf.estimator☆21Jul 6, 2023Updated 2 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆43May 9, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Jul 6, 2023Updated 2 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- Implementation of the AlignTTS☆77Jul 6, 2023Updated 2 years ago
- A vocoder framework which had been widely used in research community since 1999.☆186Dec 24, 2018Updated 7 years ago
- speech recognition based on deep neural network/hidden markov model☆10Jun 3, 2020Updated 5 years ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆14Feb 5, 2025Updated last year
- Read and write HTK and HTS files from python.☆20Mar 17, 2015Updated 11 years ago
- Yin pitch estimator in PyTorch☆118Nov 7, 2022Updated 3 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42May 29, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- context labels and pronunciation data for JSUT corpus☆77Sep 2, 2021Updated 4 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆191Nov 18, 2021Updated 4 years ago
- A library for hidden semi-Markov models with explicit durations☆88Aug 21, 2021Updated 4 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Sep 10, 2021Updated 4 years ago
- Gaussian Mixture VAE Tacotron☆54Jul 6, 2023Updated 2 years ago
- Voice conversion tools for STRAIGHT☆29Jul 17, 2020Updated 5 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆169Jul 6, 2023Updated 2 years ago