ORI-Muchim / One-Click-MB-iSTFT-VITS2Links

MB-iSTFT-VITS2(Data Preprocessing + Whisper + Text Preprocessing + Making config.json + Training, Inference) ONE-CLICK

☆13

Alternatives and similar repositories for One-Click-MB-iSTFT-VITS2

Users that are interested in One-Click-MB-iSTFT-VITS2 are comparing it to the libraries listed below

Sorting:

kdrkdrkdr / JK-VITS
Bilingual-TTS (Japanese and Korean)
☆30Updated 2 years ago
ORI-Muchim / Grad-TTS
'Grad-TTS' with Multilingual Cleaners
☆10Updated last year
AkshathRaghav / tinyspeech
Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"
☆19Updated last month
reppy4620 / x-vits
☆13Updated 8 months ago
Keith-Hon / vits-cantonese
Cantonese Text to Speech with VITS implementation
☆31Updated 2 years ago
p0p4k / vits3_pytorch
☆29Updated last year
duerig / StyleTTS2
StyleTTS 2 Optimized Training Fork
☆32Updated 5 months ago
reppy4620 / convnext_tts
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆17Updated 8 months ago
Stylish-TTS / stylish-tts
High quality text-to-speech based on StyleTTS 2.
☆52Updated this week
ORI-Muchim / PolyLangVITS
Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)
☆76Updated last year
iamanigeeit / present
☆13Updated 10 months ago
Zhongxu-Wang / ArtSpeech
ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations
☆18Updated 2 months ago
idiap / knn-tts
Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model
☆27Updated 2 months ago
frankyoujian / Edge-Punct-Casing
☆28Updated 5 months ago
parrot-tts / Parrot-TTS
Official Code for ParrotTTS
☆52Updated 9 months ago
5Hyeons / StyleTTS2-Vocos
StyleTTS2 + Vocos as a Decoder
☆13Updated 3 months ago
ORI-Muchim / AudioSR-Upsampling
AudioSR-Upsampling (any -> 48kHz)
☆41Updated last year
jisang93 / VISinger
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆15Updated 2 years ago
MaxMax2016 / Glow-SVC
4G GPU & 10 Minutes for train
☆12Updated last year
pengzhendong / Torchaudio-Forced-Aligner
Torchaudio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.
☆11Updated 6 months ago
wetdog / wavenext_pytorch
Unofficial implementation of wavenext vocoder
☆48Updated 10 months ago
ShoukanLabs / VoPho
A collection of all our phonemeizers for dataset construction and inference
☆24Updated 4 months ago
D-Keqi / LS-Transducer-SST
☆11Updated last year
choiHkk / Transformer-TTS-V2
☆25Updated last year
ArenAcikgoz / Whisper-Alignment
Forced alignment decoder for Whisper.
☆14Updated last year
MaxMax2016 / StreamingHiFiGAN
An Open-source Streaming High-fidelity Neural Audio Codec
☆11Updated last year
PlayVoice / VI-SVC
VI-SVC model is just VITS without MAS and DurationPredictor.
☆10Updated last year
naver / multilingual-distilwhisper
This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.
☆27Updated last year
lifeiteng / Aligner-SUPERB
Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark
☆28Updated 2 months ago
IDEA-Emdoor-Lab / UniTTS
A TTS Trained on Universal Audio.
☆37Updated last month