zerlinwang / synthetic-corpus-vocoderLinks

Official repository for the paper "A SYNTHETIC CORPUS GENERATION METHOD FOR NEURAL VOCODER TRAINING"

☆1

Alternatives and similar repositories for synthetic-corpus-vocoder

Users that are interested in synthetic-corpus-vocoder are comparing it to the libraries listed below

Sorting:

zengchang233 / CrossSinger
The source code for the paper CrossSinger (asru2023)
☆18Updated last year
p1an-lin-jung / wv_tts
☆19Updated last year
cyhuang-tw / robust-vc
☆11Updated 3 years ago
ryota-komatsu / speech_resynth
Speech Resynthesis and Language Modeling
☆20Updated last month
CODEJIN / XiaoiceSing2
☆19Updated 2 years ago
meaningTeam / tidy-tunes
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆21Updated last week
ex3ndr / supervoice-librilight-preprocessed
60k hours of phoneme-aligned audio from audio books
☆18Updated 11 months ago
mcf330 / efts2code
source code of EfficientTTS 2
☆14Updated last year
shengcanxu / canoSpeech
text to speech
☆10Updated last year
anton-kashkin / hifi_vc
☆25Updated 2 years ago
lexkoro / cfm-vc
☆11Updated 4 months ago
yangdongchao / ALMTokenizer2
The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…
☆26Updated last month
exercise-book-yq / FreeCodec
FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS
☆21Updated 10 months ago
huutuongtu / Lightvoc
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18Updated last year
speechnovateur / languagecodec_tmp
Temporary anonymous version
☆22Updated last year
ddlBoJack / Awesome-Speech-Generation
Paper, Code and Statistics for Speech Generatation.
☆10Updated 2 years ago
thuhcsi / SnakeGAN
Please visit https://thuhcsi.github.io/SnakeGAN/
☆37Updated 2 years ago
Infinity-INF / fast-phasr
Phonemes and durations labeling based on whisper small
☆11Updated last year
BUTSpeechFIT / TS_SUPERB
☆15Updated 3 months ago
mushanshanshan / ESLTTS
ESLTTS dataset
☆16Updated 5 months ago
JusperLee / Gull-Codec-Training
☆13Updated 4 months ago
ajaybati / miipher2.0
Reimplementation of Miipher
☆22Updated last year
NVIDIA / elucidated-text-to-audio
Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with syntheti…
☆33Updated 2 weeks ago
karchkha / MelSpec_GPT_VQVAE
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Updated last year
audiodemo / voice-conversion
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Updated last year
asuni / PitchSqueezer
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆34Updated last year
yoongi43 / VRVQ
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Updated 3 months ago
thuhcsi / PortableTTS
☆12Updated 2 years ago
jisang93 / VISinger
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆15Updated 2 years ago
Mddct / transformer-vocos
☆28Updated last week