Rumeysakeskin / Turkish-Text-to-SpeechLinks
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
☆57Updated last year
Alternatives and similar repositories for Turkish-Text-to-Speech
Users that are interested in Turkish-Text-to-Speech are comparing it to the libraries listed below
Sorting:
- Turkish Speech Recognition using Facebook's Wav2vec 2.0 models☆30Updated 3 years ago
- ☆167Updated 8 months ago
- A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tat…☆70Updated 2 years ago
- Verifying the identity of a person from characteristics of the voice independent from language via NVIDIA NeMo models (ECAPA-TDNN, Speake…☆36Updated 2 years ago
- VoiceHub: A Unified Inference Interface for TTS Models☆50Updated 3 weeks ago
- Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analy…☆71Updated 4 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆85Updated 9 months ago
- Finetune VITS and MMS using HuggingFace's tools☆162Updated last year
- Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUI🐸TTS(Text-to-Speech) based high performing neural…☆42Updated 2 years ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆250Updated last year
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆127Updated 2 years ago
- Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITS☆56Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆150Updated last year
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- finetune llm part for spark-tts model☆106Updated 5 months ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- Create an LJSpeech structured voice dataset on wave input☆33Updated 11 months ago
- [WIP] VoiceSmith makes training text to speech models easy.☆225Updated 2 years ago
- Speaker diarization model☆28Updated 2 years ago
- ☆27Updated 2 years ago
- SoTA open-source TTS☆81Updated 2 months ago
- This project is about performing Speaker diarization for Hindi Language.☆50Updated 4 years ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated 2 years ago
- The Real time Noise cancellation from Audio data signal . Like the construction noise with the denoising the signal .☆124Updated 3 years ago
- A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uy…☆72Updated last month
- VALL-E 2 reproduction☆129Updated last year
- Your one-stop solution for voice dataset creation☆123Updated last year
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆62Updated 3 years ago
- TTS models for Arabic (Tacotron2, FastPitch)☆117Updated 9 months ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆37Updated 3 months ago