entn-at/DurIAN-1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/entn-at/DurIAN-1)

entn-at / DurIAN-1

Implementation of "DurIAN: Duration Informed Attention Network For Multimodal Synthesis".

☆15

Alternatives and similar repositories for DurIAN-1

Users that are interested in DurIAN-1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ivanvovk / durian-pytorch
View on GitHub
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.
☆184Aug 12, 2020Updated 5 years ago
Rongjiehuang / Multiband-WaveRNN
View on GitHub
An unofficial implement of autoregressive vocoder Multiband-WaveRNN. Audio samples in https://rongjiehuang.github.io/Multiband-WaveRNN/
☆28Feb 12, 2021Updated 5 years ago
Aria-K-Alethia / speaking-rate-controllable-hifi-gan
View on GitHub
☆16Apr 4, 2022Updated 4 years ago
LEEYOONHYUNG / GraphTTS
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
lifeiteng / TTS-TextAnalyzer
View on GitHub
TTS Text Analyzer
☆31Jul 20, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
nwpuaslp / TTS_Course
View on GitHub
☆70Nov 30, 2020Updated 5 years ago
qiuyuda / InverseFaceRender
View on GitHub
tensorflow code for inverse face rendering
☆19Mar 27, 2020Updated 6 years ago
CODEJIN / Glow_TTS
View on GitHub
An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.
☆55Sep 14, 2022Updated 3 years ago
gteu / realtime-ppg-vc
View on GitHub
Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.
☆29Mar 3, 2022Updated 4 years ago
rishikksh20 / melgan
View on GitHub
MelGAN implementation with Multi-Band and Full Band supports...
☆63Aug 27, 2020Updated 5 years ago
rarefin / TTS_VAE
View on GitHub
Text to Speech Synthesis based on controllable latent representation
☆14Aug 30, 2019Updated 6 years ago
cpuimage / Tacotron-2
View on GitHub
Tensorflow implementation of DeepMind's Tacotron-2 (without wavenet)
☆11Jul 12, 2019Updated 7 years ago
candlewill / RawNet
View on GitHub
RawNet: Fast End-to-End Neural Vocoder
☆43May 29, 2019Updated 7 years ago
ex3ndr / supervoice-librilight-preprocessed
View on GitHub
60k hours of phoneme-aligned audio from audio books
☆19Jul 27, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yanggeng1995 / Multi-band-WaveRNN
View on GitHub
☆45Dec 16, 2019Updated 6 years ago
liuhuang31 / g2pw_once
View on GitHub
G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…
☆14Dec 30, 2023Updated 2 years ago
li1jkdaw / LPCNet_parallel
View on GitHub
Simulation of parallel synthesis with LPCNet vocoder
☆14May 5, 2020Updated 6 years ago
ishine / PnG-BERT
View on GitHub
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS
☆24Jan 29, 2022Updated 4 years ago
XierHacker / Model_Fusion_Based_Prosody_Prediction
View on GitHub
Model Fusion Based Prosody Prediction
☆17Mar 18, 2018Updated 8 years ago
bfs18 / tacotron2
View on GitHub
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
☆51Nov 1, 2019Updated 6 years ago
shakingWaves / LPCNet_torch
View on GitHub
torch version of LPCNet
☆22Jul 8, 2020Updated 6 years ago
rishikksh20 / gmvae_tacotron
View on GitHub
Gaussian Mixture VAE Tacotron
☆54Jul 6, 2023Updated 3 years ago
MlWoo / sentence2pinyin
View on GitHub
tts fronted-end
☆11Dec 19, 2018Updated 7 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ajinkyakulkarni14 / ERISHA
View on GitHub
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆44Dec 17, 2020Updated 5 years ago
ishine / Mutiband-HIFIGAN
View on GitHub
Mutiband version of HIFIGAN
☆19Nov 6, 2020Updated 5 years ago
ex3ndr / supervoice-gpt
View on GitHub
GPT-style network for phonemization with durations of text
☆68Mar 21, 2024Updated 2 years ago
YichaoL / Chinese_Polyphone_Disambiguation
View on GitHub
论文复现，使用pos标记进行中文多音字消歧
☆21Jul 20, 2019Updated 7 years ago
carricky / Image_Blend
View on GitHub
OpenCV implementation of the poisson image blend and Mean-Value-Coordinate image clone method
☆10Nov 14, 2017Updated 8 years ago
HappyBall / tacotron
View on GitHub
tacotron for research on Chinese speech synthesis and Taiwanese speech synthesis from Chinese input text sequence with different granular…
☆25Aug 2, 2018Updated 7 years ago
dukGuo / valle-audiodec
View on GitHub
Inference code for Audiodec-Valle-Wenetspeech4TTS
☆51Jul 14, 2024Updated 2 years ago
hhguo / FastGriffinLim_Pytorch
View on GitHub
☆13Nov 16, 2020Updated 5 years ago
SwordWong / FunctionalRecovery
View on GitHub
☆10Apr 26, 2017Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
thuhcsi / VAENAR-TTS
View on GitHub
The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.
☆144Jul 8, 2021Updated 5 years ago
Deepest-Project / AlignTTS
View on GitHub
Implementation of the AlignTTS
☆77Jul 6, 2023Updated 3 years ago
azraelkuan / voice-conversion
View on GitHub
an tutorial implement of voice conversion using pytorch
☆34Mar 30, 2018Updated 8 years ago
L0SG / WaveFlow
View on GitHub
A PyTorch implementation of "WaveFlow: A Compact Flow-based Model for Raw Audio" (ICML 2020)
☆127Jul 25, 2024Updated 2 years ago
rishikksh20 / Zero-Shot-TTS
View on GitHub
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
☆34Sep 24, 2021Updated 4 years ago
MiniXC / phones
View on GitHub
A collection of utilities for handling IPA phones.
☆27Sep 24, 2023Updated 2 years ago
rhoposit / multilingual_VQVAE
View on GitHub
☆37May 8, 2021Updated 5 years ago