reppy4620 / vocos

My implementation of Vocos for comparison.

☆12

Related projects ⓘ

Alternatives and complementary repositories for vocos

tonnetonne814 / SiFi-VITS2-44100-Ja
DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.
☆51Updated last year
lifeiteng / Aligner-SUPERB
Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark
☆22Updated 4 months ago
yukara-ikemiya / wavefit-pytorch
PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.
☆47Updated last month
wetdog / wavenext_pytorch
Unofficial implementation of wavenext vocoder
☆32Updated 2 months ago
Edresson / ZS-TTS-Evaluation
☆32Updated 2 months ago
MaxMax2016 / Glow-SVC
4G GPU & 10 Minutes for train
☆12Updated last year
PlayVoice / BigVGAN
BigVGAN with Neural Source-Filter
☆50Updated last year
innnky / descript-audio-vae
VAE modified from Descript Audio Codec, which replaces the RVQ with VAE
☆54Updated 7 months ago
line / promptttspp
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions
☆60Updated last month
reppy4620 / vocoders
My vocoder experiments
☆21Updated last month
freds0 / CML-TTS-Dataset
CML-TTS: A Multilingual Dataset for Speech Synthesis
☆29Updated 3 months ago
exercise-book-yq / Supercodec
☆42Updated last month
p1an-lin-jung / wv_tts
☆19Updated 8 months ago
shivammehta25 / BetterFastSpeech2
Just another FastSpeech 2 but cleaner code :)
☆25Updated 4 months ago
choiHkk / Transformer-TTS-V2
☆26Updated 8 months ago
shang0712 / HierTTS
☆44Updated last year
liuhuadai / ViT-TTS
PyTorch Implementation of ViT-TTS (EMNLP'23)
☆10Updated last year
y-chan / hifi-gan-misrnet
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Updated last year
speechnovateur / languagecodec_tmp
Temporary anonymous version
☆22Updated 8 months ago
maxrmorrison / promonet
Prosody and Pronunciation Modification Network
☆44Updated 3 months ago
amphionspace / tts-evaluation
An evaluation set for large-scale trained TTS models (Coming in Sep 2024)
☆12Updated 2 months ago
prml-lab-speech-team / demo
☆25Updated 3 months ago
Mu-Y / DiariST
☆18Updated last year
hcy71o / SC-CNN
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems
☆39Updated last year
redmist328 / APNet2
Source code of APNet2, a vocoder
☆51Updated 11 months ago
ndkgit339 / spe-dss
Speech Parameter Estimation Using Differentiable Speech Synthesizer
☆44Updated last year
Ereboas / TacoLM
☆16Updated 6 months ago
AlexandaJerry / SingingVoice-MFA-Training
MFA acoustic model training based on Opencpop
☆12Updated 2 years ago
asuni / PitchSqueezer
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆34Updated 10 months ago