as-ideas / ForwardTacotron

⏩ Generating speech in a single forward pass without any attention!

☆579

Alternatives and similar repositories for ForwardTacotron:

Users that are interested in ForwardTacotron are comparing it to the libraries listed below

NVIDIA / mellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…
☆855Updated last year
seungwonpark / melgan
MelGAN vocoder (compatible with NVIDIA/tacotron2)
☆641Updated 4 years ago
jaywalnut310 / glow-tts
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
☆674Updated 2 years ago
NVIDIA / flowtron
Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style tr…
☆897Updated last year
as-ideas / TransformerTTS
🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
☆1,137Updated 8 months ago
Kyubyong / css10
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
☆468Updated 4 years ago
SforAiDl / Neural-Voice-Cloning-With-Few-Samples
This repository has implementation for "Neural Voice Cloning With Few Samples"
☆432Updated 3 years ago
descriptinc / melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
☆989Updated last year
xcmyz / FastSpeech
The Implementation of FastSpeech based on pytorch.
☆862Updated last year
nii-yamagishilab / multi-speaker-tacotron
VCTK multi-speaker tacotron for ICASSP 2020
☆265Updated 2 years ago
deterministic-algorithms-lab / Cross-Lingual-Voice-Cloning
Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
☆358Updated last year
syang1993 / gst-tacotron
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
☆368Updated 6 years ago
auspicious3000 / autovc
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
☆1,025Updated 2 months ago
r9y9 / gantts
PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
☆516Updated 4 years ago
Tomiinek / Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
☆833Updated last year
leimao / Voice-Converter-CycleGAN
Voice Converter Using CycleGAN and Non-Parallel Data
☆526Updated last year
soobinseo / Transformer-TTS
A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
☆660Updated last year
Deepest-Project / MelNet
Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"
☆208Updated 5 months ago
r9y9 / tacotron_pytorch
PyTorch implementation of Tacotron speech synthesis model.
☆309Updated 5 years ago
kan-bayashi / ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
☆1,584Updated 8 months ago
facebookresearch / voxpopuli
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation
☆520Updated last year
auspicious3000 / SpeechSplit
Unsupervised Speech Decomposition Via Triple Information Bottleneck
☆660Updated 2 months ago
coqui-ai / open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
☆1,298Updated 7 months ago
tugstugi / pytorch-dc-tts
Text to Speech with PyTorch (English and Mongolian)
☆185Updated 3 months ago
speechbrain / speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech proc…
☆366Updated last month
bshall / UniversalVocoding
A PyTorch implementation of "Robust Universal Neural Vocoding"
☆238Updated 4 years ago
liusongxiang / StarGAN-Voice-Conversion
This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial ne…
☆517Updated 5 years ago
mozilla / DSAlign
DeepSpeech based forced alignment tool
☆235Updated 4 years ago
resemble-ai / MelNet
WIP: Open Source Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"
☆250Updated 5 years ago
KinglittleQ / GST-Tacotron
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
☆366Updated 2 years ago