espnet / espnet_tts_frontend
Text frontend for ESPnet tts recipes
☆31Updated 3 years ago
Alternatives and similar repositories for espnet_tts_frontend:
Users that are interested in espnet_tts_frontend are comparing it to the libraries listed below
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆61Updated 3 years ago
- Python wrapper for kaldi's arpa2fst☆38Updated 3 months ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Updated 5 years ago
- CMU multilingual speech repository☆31Updated 2 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆50Updated 8 months ago
- it's ASR decoder and make graph project☆32Updated 2 years ago
- multilingual speech aligner☆72Updated last year
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Updated 3 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- ☆42Updated 4 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 4 years ago
- Implementation of the subscale framework from the WaveRNN paper, building on top of Fatchord's WaveRNN repo☆17Updated 4 years ago
- Implementation of the AlignTTS☆76Updated last year
- A system works on singing voice synthesis☆79Updated 2 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆42Updated 4 years ago
- ☆22Updated 5 years ago
- Simulation of parallel synthesis with LPCNet vocoder☆14Updated 4 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 3 years ago
- Tacotron2 with Global Style Tokens☆64Updated 5 years ago
- ☆31Updated last year
- ☆76Updated 3 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆73Updated 2 months ago
- ☆33Updated 3 years ago
- A PyTorch implementation of the universal neural vocoder☆67Updated 4 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆68Updated 4 years ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆35Updated 5 years ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago