pytorch implementation of DNN-HSMM for TTS
☆71Mar 14, 2021Updated 5 years ago
Alternatives and similar repositories for DNN-HSMM
Users that are interested in DNN-HSMM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Jun 27, 2018Updated 7 years ago
- GlottDNN vocoder and tools for training DNN excitation models☆33Feb 27, 2021Updated 5 years ago
- ☆69Mar 31, 2021Updated 5 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Oct 14, 2019Updated 6 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- Official implementation of BVAE-TTS☆173Sep 26, 2022Updated 3 years ago
- using world vocoder to extract features and make data for training neural networks☆11Oct 9, 2017Updated 8 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- Neural HMMs are all you need (for high-quality attention-free TTS)☆165Mar 16, 2026Updated 3 weeks ago
- ☆198May 3, 2024Updated last year
- ☆45Dec 16, 2019Updated 6 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A fork of sinsy: HMM/DNN-based singing voice synthesis system☆73Feb 6, 2022Updated 4 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- Charsiu: A neural phonetic aligner.☆339Sep 19, 2022Updated 3 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别☆12Oct 25, 2020Updated 5 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- ☆10Dec 10, 2021Updated 4 years ago
- WaveNet implementation using tf.estimator☆21Jul 6, 2023Updated 2 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆43May 9, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Jul 6, 2023Updated 2 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- Implementation of the AlignTTS☆77Jul 6, 2023Updated 2 years ago
- A vocoder framework which had been widely used in research community since 1999.☆186Dec 24, 2018Updated 7 years ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆14Feb 5, 2025Updated last year
- Read and write HTK and HTS files from python.☆20Mar 17, 2015Updated 11 years ago
- Yin pitch estimator in PyTorch☆117Nov 7, 2022Updated 3 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42May 29, 2019Updated 6 years ago
- context labels and pronunciation data for JSUT corpus☆77Sep 2, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A library for hidden semi-Markov models with explicit durations☆87Aug 21, 2021Updated 4 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆191Nov 18, 2021Updated 4 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Sep 10, 2021Updated 4 years ago
- Gaussian Mixture VAE Tacotron☆54Jul 6, 2023Updated 2 years ago
- Voice conversion tools for STRAIGHT☆29Jul 17, 2020Updated 5 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆169Jul 6, 2023Updated 2 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago