Mildemelwe / Non-English-Tacotron-2-Training-NotebookLinks
Tacotron 2 training notebook supporting Japanese, French, and Mandarin
☆11Updated 2 years ago
Alternatives and similar repositories for Non-English-Tacotron-2-Training-Notebook
Users that are interested in Non-English-Tacotron-2-Training-Notebook are comparing it to the libraries listed below
Sorting:
- Real-time end-to-end singing voice convertion☆22Updated 8 months ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆12Updated 7 months ago
- Speech AI training and inference tools☆36Updated 2 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆19Updated 2 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆30Updated last year
- List of repositories relevant to VITS.☆36Updated 2 years ago
- RTVC: Real-Time Voice Conversion GUI☆56Updated last year
- ☆25Updated 11 months ago
- A project about learning how to synchronize subtitles in movies using machine learning.☆9Updated 2 years ago
- RVC Onnx Infer- Upgraded and simplified-ish☆21Updated last year
- liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project☆33Updated last year
- 44100Hz日本語音源に対応した MB-iSTFT-VITS: Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Tim…☆37Updated 2 years ago
- 'Grad-TTS' with Multilingual Cleaners☆10Updated last year
- ☆28Updated 2 months ago
- Ready-to-use Multilingual Text-To-Speech (TTS) package.☆22Updated last year
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.☆68Updated last month
- ☆10Updated 8 months ago
- 44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。☆20Updated 2 years ago
- speaker-disentangled speech linguistic content quantizer☆21Updated 3 months ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated 9 months ago
- A collection of all our phonemeizers for dataset construction and inference☆24Updated 4 months ago
- ☆28Updated last year
- Finally, some decent sample sentences☆23Updated last year
- Japanese Dataset to Multi Language TTS (Only for Japanese Dataset)☆3Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- StyleTTS 2 Optimized Training Fork☆32Updated 5 months ago
- VITS2 using Phoneme-Level Japanese BERT☆13Updated last year
- RVC Inference with multiple model and huggingface support☆106Updated last year