rarefin/TTS_VAE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rarefin/TTS_VAE)

rarefin / TTS_VAE

Text to Speech Synthesis based on controllable latent representation

☆14

Alternatives and similar repositories for TTS_VAE

Users that are interested in TTS_VAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LEEYOONHYUNG / GraphTTS
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
rishikksh20 / gmvae_tacotron
View on GitHub
Gaussian Mixture VAE Tacotron
☆54Jul 6, 2023Updated 3 years ago
tachi-hi / tts_samples
View on GitHub
Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…
☆15May 30, 2021Updated 5 years ago
mutiann / neural-lexicon-reader
View on GitHub
Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge
☆21Jul 25, 2022Updated 3 years ago
asuni / wavelet_prosody_toolkit
View on GitHub
☆200May 3, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
entn-at / DurIAN-1
View on GitHub
Implementation of "DurIAN: Duration Informed Attention Network For Multimodal Synthesis".
☆15Jul 6, 2020Updated 6 years ago
rosinality / melgan-pytorch
View on GitHub
MelGAN and Tacotron 2 in PyTorch
☆11Oct 22, 2019Updated 6 years ago
Infinity-INF / fast-phasr
View on GitHub
Phonemes and durations labeling based on whisper small
☆11Jul 7, 2024Updated 2 years ago
yanggeng1995 / vae_tacotron
View on GitHub
☆51Feb 15, 2019Updated 7 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
lingjzhu / probing-TTS-models
View on GitHub
Link to paper: https://www.isca-speech.org/archive_v0/SpeechProsody_2020/pdfs/51.pdf
☆32Jul 6, 2023Updated 3 years ago
jyhan03 / dpccn
View on GitHub
This repository provides an implementation of the DPCCN model for single-channel speech separation. More details will be updated soon.
☆13Dec 8, 2021Updated 4 years ago
Yablon / auorange
View on GitHub
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
☆62Jun 8, 2021Updated 5 years ago
thuhcsi / Crystal
View on GitHub
Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
☆230Aug 17, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lifeiteng / TTS-TextAnalyzer
View on GitHub
TTS Text Analyzer
☆31Jul 20, 2023Updated 3 years ago
y-chan / hifi-gan-misrnet
View on GitHub
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Mar 21, 2023Updated 3 years ago
shakingWaves / LPCNet_torch
View on GitHub
torch version of LPCNet
☆22Jul 8, 2020Updated 6 years ago
BridgetteSong / Tacotron2
View on GitHub
☆13Sep 21, 2022Updated 3 years ago
CODEJIN / HiFiSinger
View on GitHub
☆111Jun 11, 2021Updated 5 years ago
awesome-archive / tacotron_cn
View on GitHub
chinese_tacotron-2
☆12Feb 27, 2018Updated 8 years ago
cpuimage / Tacotron-2
View on GitHub
Tensorflow implementation of DeepMind's Tacotron-2 (without wavenet)
☆11Jul 12, 2019Updated 7 years ago
xcmyz / Lifelong-Learning-Tacotron2
View on GitHub
MultiSpeaker Tacotron2 using LifeLong Learning.
☆13Sep 27, 2019Updated 6 years ago
ex3ndr / supervoice-librilight-preprocessed
View on GitHub
60k hours of phoneme-aligned audio from audio books
☆19Jul 27, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
bshall / Tacotron
View on GitHub
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
☆115Dec 2, 2020Updated 5 years ago
dathudeptrai / FastSpeech2
View on GitHub
A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
☆11Aug 12, 2020Updated 5 years ago
li1jkdaw / LPCNet_parallel
View on GitHub
Simulation of parallel synthesis with LPCNet vocoder
☆14May 5, 2020Updated 6 years ago
shang0712 / HierTTS
View on GitHub
☆47Apr 16, 2023Updated 3 years ago
roedoejet / FastSpeech2_ACL2022_reproducibility
View on GitHub
☆21Feb 27, 2024Updated 2 years ago
dipjyoti92 / TTS-Style-Transfer
View on GitHub
Official PyTorch implementation of TTS Style Transfer
☆25Jun 22, 2022Updated 4 years ago
zzw922cn / LPC_for_TTS
View on GitHub
Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.
☆72Mar 19, 2021Updated 5 years ago
v-nhandt21 / MusicVoiceConversion
View on GitHub
Sing any popular song with your voice
☆11Jul 10, 2022Updated 4 years ago
fmarcinek / LICNN
View on GitHub
Lateral Inhibition-Inspired Convolutional Neural Network for Visual Attention and Saliency Detection
☆13Nov 6, 2020Updated 5 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
chaitanya100100 / Relative-Attributes-Zero-Shot-Learning
View on GitHub
Python Implementation of Visual Relative Attributes for Image Classification and Zero Shot Learning
☆22Jun 14, 2018Updated 8 years ago
keonlee9420 / Comprehensive-E2E-TTS
View on GitHub
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…
☆147Jun 6, 2022Updated 4 years ago
erogol / ParallelWaveGAN
View on GitHub
ParallelWaveGAN adaptation for Mozilla TTS
☆15May 23, 2020Updated 6 years ago
BoragoCode / AttentionBasedProsodyPrediction
View on GitHub
Encoder and Decoder and Attention Based Prosody Prediction
☆68Jan 17, 2018Updated 8 years ago
ernoult / targetProp
View on GitHub
Testing Difference Target Propagation (DTP) on MNIST.
☆13Oct 12, 2020Updated 5 years ago
ishine / Mutiband-HIFIGAN
View on GitHub
Mutiband version of HIFIGAN
☆19Nov 6, 2020Updated 5 years ago
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago