Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
☆229Aug 17, 2020Updated 5 years ago
Alternatives and similar repositories for Crystal
Users that are interested in Crystal are comparing it to the libraries listed below
Sorting:
- Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.☆87Aug 17, 2020Updated 5 years ago
- A Demo of Mandarin/Chinese TTS frontend☆285Apr 18, 2022Updated 3 years ago
- Chinese text normalization for speech processing☆721Mar 18, 2023Updated 2 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆157Jul 2, 2021Updated 4 years ago
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset☆361Dec 24, 2021Updated 4 years ago
- Chinese Text Normalization and Dataset☆91May 14, 2022Updated 3 years ago
- TTS-frontend with Bert and CRF/lstm (For Tacotron)☆53Jun 2, 2020Updated 5 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆184Aug 12, 2020Updated 5 years ago
- ☆69Mar 31, 2021Updated 4 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 5 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Jun 22, 2022Updated 3 years ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆103Feb 5, 2024Updated 2 years ago
- ☆45Dec 16, 2019Updated 6 years ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆168Apr 10, 2024Updated last year
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆203Sep 4, 2022Updated 3 years ago
- PPG-Based Voice Conversion☆348Jul 22, 2022Updated 3 years ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Search☆702Jul 12, 2022Updated 3 years ago
- Efficient neural speech synthesis☆81Nov 25, 2020Updated 5 years ago
- ☆45Oct 24, 2020Updated 5 years ago
- Chinese Prosodic Structure Prediction☆10May 18, 2019Updated 6 years ago
- Adaptive Vocoder for Custom Voice☆61Sep 22, 2022Updated 3 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆247Oct 30, 2019Updated 6 years ago
- ☆197May 3, 2024Updated last year
- Predict prosody labels for Chinese sentences.☆41Jul 7, 2022Updated 3 years ago
- Efficient neural speech synthesis☆1,203Sep 21, 2024Updated last year
- Encoder and Decoder and Attention Based Prosody Prediction☆68Jan 17, 2018Updated 8 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆71Mar 19, 2021Updated 4 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Sep 21, 2022Updated 3 years ago
- ☆46Apr 16, 2023Updated 2 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆410Apr 8, 2020Updated 5 years ago
- Implementation of the AlignTTS☆77Jul 6, 2023Updated 2 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆191Nov 18, 2021Updated 4 years ago
- Official implementation of BVAE-TTS☆173Sep 26, 2022Updated 3 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆116Dec 22, 2021Updated 4 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Nov 14, 2020Updated 5 years ago