AppleHolic / FastSpeech2Links

Refactored version of https://github.com/ming024/FastSpeech2

☆14

Alternatives and similar repositories for FastSpeech2

Users that are interested in FastSpeech2 are comparing it to the libraries listed below

Sorting:

revsic / torch-whisper-guided-vc
Torch implementation of Whisper-guided DDPM based Voice Conversion
☆49Updated 2 years ago
nc-ai / speech
☆17Updated last month
CODEJIN / MLPSinger
☆24Updated 3 years ago
Sreyan88 / LipGER
Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
☆17Updated last year
minkjung / blankcollapse
☆10Updated 2 years ago
KrishnaDN / BERTphone
Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
☆17Updated 4 years ago
CODEJIN / Speaker_Embedding_Torch
PyTorch based speaker embedding model
☆16Updated last year
zldzmfoq12 / VCtube
A pakage for crawling audio from Youtube
☆42Updated last year
dipjyoti92 / TTS-Style-Transfer
Official PyTorch implementation of TTS Style Transfer
☆24Updated 3 years ago
sooftware / lightning-asr
Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
☆46Updated 4 years ago
Aria-K-Alethia / speaking-rate-controllable-hifi-gan
☆16Updated 3 years ago
monglechap / fluenttts
FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS
☆20Updated 2 years ago
midas-research / speechmix
☆12Updated 4 years ago
cschaefer26 / StyleMelGAN
☆10Updated last year
keonlee9420 / Deep-Learning-TTS-Template
This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).
☆15Updated 4 years ago
karchkha / MelSpec_GPT_VQVAE
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Updated last year
rishikksh20 / Zero-Shot-TTS
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
☆34Updated 3 years ago
speechnovateur / languagecodec_tmp
Temporary anonymous version
☆22Updated last year
xcmyz / Tacotron2-Pytorch
follow NVIDIA, simplify it and support data parallel.
☆13Updated 5 years ago
CODEJIN / DiffSingerKR
☆25Updated 10 months ago
Jackson-Kang / MFARunner
A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.
☆45Updated 2 years ago
rishikksh20 / NU-Wave2-pytorch
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]
☆24Updated 3 years ago
ldong1111 / GraphemeBERT
This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models
☆46Updated 3 years ago
hyama5 / vae_align
Alignment examples for Interspeech 2024
☆22Updated last year
cyfer0618 / kaldi-pytorch-rnnlm
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Updated 5 years ago
rhoposit / icassp2021
☆15Updated 4 years ago
tts-tutorial / icassp2022
☆64Updated 3 years ago
miccio-dk / NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Updated 3 years ago
revsic / torch-retriever-vc
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Updated 2 years ago
suzuki256 / dog-dataset
☆43Updated 3 years ago