Mildemelwe / Non-English-Tacotron-2-Training-Notebook
Tacotron 2 training notebook supporting Japanese, French, and Mandarin
☆11Updated 2 years ago
Alternatives and similar repositories for Non-English-Tacotron-2-Training-Notebook:
Users that are interested in Non-English-Tacotron-2-Training-Notebook are comparing it to the libraries listed below
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆10Updated 3 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆30Updated last year
- List of repositories relevant to VITS.☆36Updated 2 years ago
- StyleTTS 2 Optimized Training Fork☆26Updated last month
- Real-time end-to-end singing voice convertion☆20Updated 4 months ago
- 'Grad-TTS' with Multilingual Cleaners☆10Updated 11 months ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project☆33Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Updated 7 months ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆22Updated 7 months ago
- ☆25Updated last year
- RVC Onnx Infer- Upgraded and simplified-ish☆21Updated 10 months ago
- A collection of all our phonemeizers for dataset construction and inference☆22Updated last month
- Render wav and convert it with [Diff-SVC](https://github.com/prophesier/diff-svc) model☆10Updated 2 years ago
- IPA Phonemizer/Dephonemizer for 136 human languages☆19Updated this week
- Heteronym to Phoneme Parser☆18Updated last year
- My vocoder experiments☆27Updated 5 months ago
- 44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。☆23Updated last year
- End-to-End SpeechSynthesis system with fastspeech2 & hifigan☆13Updated 2 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆27Updated 2 years ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- ☆13Updated 7 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆19Updated 2 years ago
- ☆10Updated 4 months ago
- ☆11Updated 2 years ago
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.☆67Updated last year
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- ☆21Updated 7 months ago
- Non Parallel Voice Conversion based on VITS☆24Updated last year
- 44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。☆19Updated last year