ORI-Muchim / One-Click-MB-iSTFT-VITS2Links
MB-iSTFT-VITS2(Data Preprocessing + Whisper + Text Preprocessing + Making config.json + Training, Inference) ONE-CLICK
☆13Updated last year
Alternatives and similar repositories for One-Click-MB-iSTFT-VITS2
Users that are interested in One-Click-MB-iSTFT-VITS2 are comparing it to the libraries listed below
Sorting:
- Bilingual-TTS (Japanese and Korean)☆30Updated 2 years ago
- 'Grad-TTS' with Multilingual Cleaners☆10Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆19Updated last month
- ☆13Updated 8 months ago
- Cantonese Text to Speech with VITS implementation☆31Updated 2 years ago
- ☆29Updated last year
- StyleTTS 2 Optimized Training Fork☆32Updated 5 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated 8 months ago
- High quality text-to-speech based on StyleTTS 2.☆52Updated this week
- Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)☆76Updated last year
- ☆13Updated 10 months ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆18Updated 2 months ago
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆27Updated 2 months ago
- ☆28Updated 5 months ago
- Official Code for ParrotTTS☆52Updated 9 months ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 3 months ago
- AudioSR-Upsampling (any -> 48kHz)☆41Updated last year
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆15Updated 2 years ago
- 4G GPU & 10 Minutes for train☆12Updated last year
- Torchaudio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆11Updated 6 months ago
- Unofficial implementation of wavenext vocoder☆48Updated 10 months ago
- A collection of all our phonemeizers for dataset construction and inference☆24Updated 4 months ago
- ☆11Updated last year
- ☆25Updated last year
- Forced alignment decoder for Whisper.☆14Updated last year
- An Open-source Streaming High-fidelity Neural Audio Codec☆11Updated last year
- VI-SVC model is just VITS without MAS and DurationPredictor.☆10Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆27Updated last year
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆28Updated 2 months ago
- A TTS Trained on Universal Audio.☆37Updated last month