ShoukanLabs / VokanLinks

The Vokan Architecture (Tsukasa speech based)

☆10

Alternatives and similar repositories for Vokan

Users that are interested in Vokan are comparing it to the libraries listed below

Sorting:

ShoukanLabs / VoPho
A collection of all our phonemeizers for dataset construction and inference
☆27Updated 9 months ago
Respaired / RiFornet_Vocoder
a Neural Vocoder supporting Ring Attention, Conformer and NSF.
☆23Updated 4 months ago
naver-ai / RapFlow-TTS
☆47Updated 4 months ago
EZ-VC / EZ-VC
[EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion
☆30Updated 3 months ago
duerig / StyleTTS2
StyleTTS 2 Optimized Training Fork
☆34Updated 10 months ago
choiHkk / Transformer-TTS-V2
☆25Updated last year
flamed-tts / Flamed-TTS
This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …
☆56Updated 4 months ago
shivammehta25 / BetterFastSpeech2
Just another FastSpeech 2 but cleaner code :)
☆28Updated last year
huutuongtu / Lightvoc
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18Updated last year
iamanigeeit / present
☆14Updated last year
reppy4620 / vocoders
My vocoder experiments
☆31Updated 4 months ago
p1an-lin-jung / wv_tts
☆19Updated last year
ex3ndr / supervoice-hybrid
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Updated last year
MaxMax2016 / Glow-SVC
4G GPU & 10 Minutes for train
☆12Updated 2 years ago
rishikksh20 / MiniMax-TTS-pytorch
Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report
☆49Updated 3 months ago
Tikai7 / DiTTO-TTS
DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors
☆35Updated 10 months ago
OlaWod / PitchVC
PitchVC: Pitch Conditioned Any-to-Many Voice Conversion
☆36Updated last year
ArenAcikgoz / Whisper-Alignment
Forced alignment decoder for Whisper.
☆14Updated last year
reppy4620 / convnext_tts
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆17Updated last year
kamperh / linearvc
Voice conversion with just linear regression.
☆31Updated 2 months ago
adelacvg / diff-vits
☆39Updated 2 years ago
jisang93 / VISinger
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆19Updated 2 years ago
yxlu-0102 / IDEA-TTS
Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis
☆27Updated 8 months ago
ogunlao / glowtts_stdp
Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor
☆18Updated 2 years ago
hcy71o / SC-CNN
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems
☆39Updated 2 years ago
mbzuai-nlp / sttatts
☆30Updated last year
archinetai / aligner-pytorch
Sequence alignement methods with helpers for PyTorch.
☆24Updated 3 years ago
yukara-ikemiya / Open-Miipher-2
PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind
☆60Updated 2 months ago
audiodemo / voice-conversion
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Updated 2 years ago
yl4579 / SLMGAN
SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs
☆16Updated 2 years ago