ORI-Muchim / One-Click-MB-iSTFT-VITS2
MB-iSTFT-VITS2(Data Preprocessing + Whisper + Text Preprocessing + Making config.json + Training, Inference) ONE-CLICK
☆12Updated last year
Alternatives and similar repositories for One-Click-MB-iSTFT-VITS2:
Users that are interested in One-Click-MB-iSTFT-VITS2 are comparing it to the libraries listed below
- 'Grad-TTS' with Multilingual Cleaners☆10Updated last year
- Bilingual-TTS (Japanese and Korean)☆30Updated last year
- High quality text-to-speech based on StyleTTS 2.☆36Updated this week
- ☆13Updated 5 months ago
- ☆13Updated 8 months ago
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆15Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆12Updated 2 months ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆18Updated this week
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆20Updated 2 weeks ago
- AudioSR-Upsampling (any -> 48kHz)☆40Updated last year
- ☆26Updated 2 months ago
- singing voice conversion based on glow-tts☆11Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated 6 months ago
- Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)☆74Updated last year
- Japanese Dataset to Multi Language TTS (Only for Japanese Dataset)☆3Updated last year
- ☆29Updated last year
- 4G GPU & 10 Minutes for train☆12Updated last year
- Official Demo Page for DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer☆32Updated 2 months ago
- Text-To-Speech for NotebookLM☆29Updated 4 months ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆27Updated 9 months ago
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆23Updated last month
- A collection of all our phonemeizers for dataset construction and inference☆22Updated 2 months ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆16Updated last year
- Chinese and English Bilinguish G2P☆20Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated 2 weeks ago
- ☆30Updated 2 years ago
- ☆30Updated 2 years ago
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆32Updated 10 months ago
- ☆35Updated last year