Rumeysakeskin / Turkish-Text-to-SpeechLinks
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
☆58Updated last year
Alternatives and similar repositories for Turkish-Text-to-Speech
Users that are interested in Turkish-Text-to-Speech are comparing it to the libraries listed below
Sorting:
- Turkish Speech Recognition using Facebook's Wav2vec 2.0 models☆30Updated 3 years ago
- A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tat…☆72Updated 2 years ago
- ☆169Updated 9 months ago
- Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analy…☆71Updated 5 months ago
- Verifying the identity of a person from characteristics of the voice independent from language via NVIDIA NeMo models (ECAPA-TDNN, Speake…☆36Updated 2 years ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆84Updated 10 months ago
- Finetune VITS and MMS using HuggingFace's tools☆164Updated last year
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆127Updated 2 years ago
- VoiceHub: A Unified Inference Interface for TTS Models☆52Updated 3 weeks ago
- Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUI🐸TTS(Text-to-Speech) based high performing neural…☆41Updated 2 years ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆252Updated last year
- finetune llm part for spark-tts model☆109Updated 6 months ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- SoTA open-source TTS☆87Updated 3 months ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Updated 8 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- ☆144Updated last month
- [WIP] VoiceSmith makes training text to speech models easy.☆225Updated 2 years ago
- ☆49Updated 2 years ago
- Mirror of hf.co/pyannote/speaker-diarization-3.1☆26Updated last year
- ☆27Updated 2 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- A simple script to prepare dataset for training with TTS Tortoise model via https://git.ecker.tech/mrq/ai-voice-cloning☆12Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆148Updated last year
- Create an LJSpeech structured voice dataset on wave input☆34Updated 11 months ago
- A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uy…☆72Updated last month
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆40Updated 4 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated 2 weeks ago
- Tools to create your own voice dataset for TTS training☆68Updated 4 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated 2 years ago