Text to Speech Synthesis based on controllable latent representation
☆14Aug 30, 2019Updated 6 years ago
Alternatives and similar repositories for TTS_VAE
Users that are interested in TTS_VAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jul 6, 2023Updated 2 years ago
- Gaussian Mixture VAE Tacotron☆54Jul 6, 2023Updated 2 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15May 30, 2021Updated 4 years ago
- ☆198May 3, 2024Updated last year
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Implementation of "DurIAN: Duration Informed Attention Network For Multimodal Synthesis".☆14Jul 6, 2020Updated 5 years ago
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- ☆51Feb 15, 2019Updated 7 years ago
- TTS Text Analyzer☆31Jul 20, 2023Updated 2 years ago
- This repository provides an implementation of the DPCCN model for single-channel speech separation. More details will be updated soon.☆13Dec 8, 2021Updated 4 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- Link to paper: https://www.isca-speech.org/archive_v0/SpeechProsody_2020/pdfs/51.pdf☆32Jul 6, 2023Updated 2 years ago
- chinese_tacotron-2☆12Feb 27, 2018Updated 8 years ago
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆62Jun 8, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆229Aug 17, 2020Updated 5 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- torch version of LPCNet☆22Jul 8, 2020Updated 5 years ago
- ☆13Sep 21, 2022Updated 3 years ago
- ☆112Jun 11, 2021Updated 4 years ago
- Tensorflow implementation of DeepMind's Tacotron-2 (without wavenet)☆11Jul 12, 2019Updated 6 years ago
- MultiSpeaker Tacotron2 using LifeLong Learning.☆13Sep 27, 2019Updated 6 years ago
- MelGAN and Tacotron 2 in PyTorch☆11Oct 22, 2019Updated 6 years ago
- ☆39Sep 25, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis☆115Dec 2, 2020Updated 5 years ago
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech☆11Aug 12, 2020Updated 5 years ago
- Simulation of parallel synthesis with LPCNet vocoder☆14May 5, 2020Updated 5 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆71Mar 19, 2021Updated 5 years ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆147Jun 6, 2022Updated 3 years ago
- ☆21Feb 27, 2024Updated 2 years ago
- Official PyTorch implementation of TTS Style Transfer☆25Jun 22, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆46Apr 16, 2023Updated 2 years ago
- ☆40Jul 15, 2025Updated 8 months ago
- Python Implementation of Visual Relative Attributes for Image Classification and Zero Shot Learning☆22Jun 14, 2018Updated 7 years ago
- ☆19May 11, 2024Updated last year
- ParallelWaveGAN adaptation for Mozilla TTS☆15May 23, 2020Updated 5 years ago
- Encoder and Decoder and Attention Based Prosody Prediction☆68Jan 17, 2018Updated 8 years ago
- ☆16Dec 23, 2021Updated 4 years ago