Yazdi9 / TTS-MultiLingualLinks
Text To Speech Multilingual Support (+20 Language)
☆45Updated 2 years ago
Alternatives and similar repositories for TTS-MultiLingual
Users that are interested in TTS-MultiLingual are comparing it to the libraries listed below
Sorting:
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆18Updated 2 weeks ago
- Official Code for ParrotTTS☆51Updated 7 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 3 weeks ago
- StyleTTS 2 Optimized Training Fork☆29Updated 4 months ago
- GPT-style network for phonemization with durations of text☆66Updated last year
- A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS☆40Updated 5 months ago
- High quality text-to-speech based on StyleTTS 2.☆47Updated last week
- ☆32Updated 2 months ago
- finetune llm part for spark-tts model☆76Updated 2 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated 2 years ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆77Updated 6 months ago
- ☆35Updated last year
- ☆57Updated 11 months ago
- Text-To-Speech for NotebookLM☆29Updated 5 months ago
- An unofficial PyTorch implementation of VALL-E☆87Updated this week
- 4G GPU & 10 Minutes for train☆12Updated last year
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated last month
- StyleTTS2 + Vocos as a Decoder☆12Updated 2 months ago
- ☆26Updated 7 months ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆24Updated 3 months ago
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆58Updated last month
- ☆25Updated last year
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆73Updated 8 months ago
- An open-source Kazakh Emotional Text-to-Speech Dataset☆29Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Updated 9 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated 7 months ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 4 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 3 years ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆27Updated 3 weeks ago
- Multispeaker Community Vocoder Model for DiffSinger☆37Updated last month