entn-at / DurIAN-1View external linksLinks
Implementation of "DurIAN: Duration Informed Attention Network For Multimodal Synthesis".
☆14Jul 6, 2020Updated 5 years ago
Alternatives and similar repositories for DurIAN-1
Users that are interested in DurIAN-1 are comparing it to the libraries listed below
Sorting:
- An unofficial implement of autoregressive vocoder Multiband-WaveRNN. Audio samples in https://rongjiehuang.github.io/Multiband-WaveRNN/☆28Feb 12, 2021Updated 5 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆184Aug 12, 2020Updated 5 years ago
- ☆70Nov 30, 2020Updated 5 years ago
- TTS Text Analyzer☆32Jul 20, 2023Updated 2 years ago
- Tensorflow implementation of DeepMind's Tacotron-2 (without wavenet)☆11Jul 12, 2019Updated 6 years ago
- Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.☆28Mar 3, 2022Updated 3 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Dec 17, 2020Updated 5 years ago
- ☆37May 8, 2021Updated 4 years ago
- Simulation of parallel synthesis with LPCNet vocoder☆14May 5, 2020Updated 5 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Oct 22, 2022Updated 3 years ago
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- Model Fusion Based Prosody Prediction☆17Mar 18, 2018Updated 7 years ago
- Mutiband version of HIFIGAN☆19Nov 6, 2020Updated 5 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42May 29, 2019Updated 6 years ago
- torch version of LPCNet☆22Jul 8, 2020Updated 5 years ago
- Text to Speech Synthesis based on controllable latent representation☆14Aug 30, 2019Updated 6 years ago
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆50Jul 14, 2024Updated last year
- ☆45Dec 16, 2019Updated 6 years ago
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆24Jan 29, 2022Updated 4 years ago
- 论文复现,使用pos标记进行中 文多音字消歧☆21Jul 20, 2019Updated 6 years ago
- Implementation of the subscale framework from the WaveRNN paper, building on top of Fatchord's WaveRNN repo☆19Oct 8, 2020Updated 5 years ago
- ☆21Jun 16, 2021Updated 4 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference☆51Nov 1, 2019Updated 6 years ago
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631☆23Aug 15, 2022Updated 3 years ago
- Code to train and run Blow☆145Sep 4, 2019Updated 6 years ago
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆54Sep 14, 2022Updated 3 years ago
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- Normalize Text in Russian☆28Nov 7, 2023Updated 2 years ago
- Source code of APNet2, a vocoder☆58Nov 23, 2023Updated 2 years ago
- Gaussian Mixture VAE Tacotron☆53Jul 6, 2023Updated 2 years ago
- GPT-style network for phonemization with durations of text☆68Mar 21, 2024Updated last year
- tacotron for research on Chinese speech synthesis and Taiwanese speech synthesis from Chinese input text sequence with different granular…☆25Aug 2, 2018Updated 7 years ago
- MelGAN implementation with Multi-Band and Full Band supports...☆62Aug 27, 2020Updated 5 years ago
- trying to reproduce suno v3☆35Jan 29, 2025Updated last year
- Attempt at speech2speech using CycleGAN☆28Jul 26, 2017Updated 8 years ago
- Multispeaker Community Vocoder Model for DiffSinger☆39Aug 11, 2025Updated 6 months ago