Yazdi9 / TTS-MultiLingual
Text To Speech Multilingual Support (+20 Language)
☆44Updated 2 years ago
Alternatives and similar repositories for TTS-MultiLingual
Users that are interested in TTS-MultiLingual are comparing it to the libraries listed below
Sorting:
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆15Updated last week
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆52Updated this week
- High quality text-to-speech based on StyleTTS 2.☆39Updated this week
- GPT-style network for phonemization with durations of text☆64Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Updated 8 months ago
- A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS☆40Updated 5 months ago
- Official Code for ParrotTTS☆50Updated 7 months ago
- ☆20Updated 2 years ago
- StyleTTS 2 Optimized Training Fork☆28Updated 3 months ago
- An official implementation of Style-Talker for Spoken Dialogue Generation☆17Updated 4 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆14Updated 2 months ago
- finetune llm part for spark-tts model☆69Updated last month
- Zero-Shot Foreign Accent Conversion without a Native Reference☆33Updated last year
- An open-source Kazakh Emotional Text-to-Speech Dataset☆28Updated last year
- Text-To-Speech for NotebookLM☆29Updated 4 months ago
- 4G GPU & 10 Minutes for train☆12Updated last year
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated last month
- ☆31Updated last month
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Updated last year
- 'Grad-TTS' with Multilingual Cleaners☆10Updated last year
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 4 years ago
- Cantonese Text to Speech with VITS implementation☆29Updated 2 years ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated last year
- Multispeaker Community Vocoder Model for DiffSinger☆37Updated last week
- ☆40Updated 3 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆71Updated 7 months ago
- Forced alignment decoder for Whisper.☆14Updated last year
- GPT for FACodec☆13Updated last year
- ☆35Updated last year