Rumeysakeskin / Turkish-Text-to-SpeechLinks
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
☆61Updated last year
Alternatives and similar repositories for Turkish-Text-to-Speech
Users that are interested in Turkish-Text-to-Speech are comparing it to the libraries listed below
Sorting:
- Turkish Speech Recognition using Facebook's Wav2vec 2.0 models☆31Updated 3 years ago
- A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tat…☆73Updated 2 years ago
- ☆172Updated 11 months ago
- Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analy…☆73Updated 7 months ago
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆127Updated 2 years ago
- Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUI🐸TTS(Text-to-Speech) based high performing neural…☆42Updated 2 years ago
- Finetune VITS and MMS using HuggingFace's tools☆172Updated last year
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆254Updated last year
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆85Updated 11 months ago
- SoTA open-source TTS☆104Updated 5 months ago
- [WIP] VoiceSmith makes training text to speech models easy.☆226Updated 3 years ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆46Updated 5 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆151Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆104Updated 4 months ago
- Fine Tune the Style-TTS2 Voice Model☆256Updated 4 months ago
- Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITS☆57Updated last year
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated 2 years ago
- Your one-stop solution for voice dataset creation☆127Updated last year
- ☆36Updated 2 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated 2 weeks ago
- ☆190Updated 3 weeks ago
- VoiceHub: A Unified Inference Interface for TTS Models☆56Updated last month
- Create an LJSpeech structured voice dataset on wave input☆36Updated last year
- A testing repo to share code and thoughts on diarisation☆56Updated last year
- Mirror of hf.co/pyannote/speaker-diarization-3.1☆27Updated last year
- finetune llm part for spark-tts model☆111Updated 7 months ago
- Verifying the identity of a person from characteristics of the voice independent from language via NVIDIA NeMo models (ECAPA-TDNN, Speake…☆38Updated 2 years ago
- Add n-gram and large language model (LLM) support to Whisper models.☆33Updated 6 months ago
- My guide to create an italian TTS with Coqui☆14Updated 3 years ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆126Updated 3 months ago