Mildemelwe / Non-English-Tacotron-2-Training-Notebook
Tacotron 2 training notebook supporting Japanese, French, and Mandarin
☆11Updated 2 years ago
Alternatives and similar repositories for Non-English-Tacotron-2-Training-Notebook
Users that are interested in Non-English-Tacotron-2-Training-Notebook are comparing it to the libraries listed below
Sorting:
- liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project☆32Updated last year
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 2 years ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated 8 months ago
- RVC Onnx Infer- Upgraded and simplified-ish☆20Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆12Updated 5 months ago
- StyleTTS 2 Optimized Training Fork☆28Updated 3 months ago
- speaker-disentangled speech linguistic content quantizer☆14Updated last month
- 44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。☆19Updated 2 years ago
- Real-time end-to-end singing voice convertion☆21Updated 6 months ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- Heteronym to Phoneme Parser☆18Updated last year
- ☆26Updated 2 weeks ago
- 'Grad-TTS' with Multilingual Cleaners☆10Updated last year
- Speech AI training and inference tools☆35Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆22Updated 2 months ago
- 💠 Aivis: AI Voice Imitation System☆28Updated last year
- Finally, some decent sample sentences☆22Updated last year
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆16Updated last week
- List of repositories relevant to VITS.☆36Updated 2 years ago
- 44100Hz日本語音源に対応した MB-iSTFT-VITS: Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Tim…☆36Updated last year
- VITS2 using Phoneme-Level Japanese BERT☆13Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆18Updated 3 months ago
- ☆25Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervision☆30Updated last year
- The Vokan Architecture (Tsukasa speech based)☆9Updated 3 months ago
- Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlp☆19Updated 3 weeks ago
- My vocoder experiments☆29Updated 7 months ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- ☆39Updated last year
- DiffSinger training colab notebook to make training easier hopefully☆43Updated this week