rgzn-aiyun / tacotron2-melgan
Mel spectrum based on tacotron2 for melgan speech synthesis
☆15Updated 2 years ago
Alternatives and similar repositories for tacotron2-melgan
Users that are interested in tacotron2-melgan are comparing it to the libraries listed below
Sorting:
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆35Updated 7 years ago
- Tensorflow Implementation of WaveGlow☆37Updated 5 years ago
- EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System☆15Updated 6 years ago
- Google's TPGST reimplementation.☆34Updated 5 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Updated 4 years ago
- magicspeech competition recipe☆18Updated 4 years ago
- Simulation of parallel synthesis with LPCNet vocoder☆14Updated 5 years ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- ☆31Updated 6 years ago
- ☆45Updated 5 years ago
- ☆20Updated 5 years ago
- 一个开源的中文歌声合成数据集。An open-source Chinese singing synthesizing dataset.☆21Updated 5 years ago
- ☆34Updated 5 years ago
- Real-time melgan based on cpu !!!☆13Updated 5 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 4 years ago
- Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"☆11Updated 2 years ago
- using world vocoder to extract features and make data for training neural networks☆11Updated 7 years ago
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Updated 5 years ago
- MultiSpeaker Tacotron2 using LifeLong Learning.☆13Updated 5 years ago
- This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"☆15Updated last year
- ☆25Updated 3 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Updated 7 years ago
- ☆22Updated 5 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Updated 4 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 3 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆15Updated 6 years ago
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Updated 3 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆39Updated 2 years ago
- tts fronted-end☆11Updated 6 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Updated 5 years ago