Rumeysakeskin / Turkish-Text-to-SpeechLinks
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
☆66Updated 2 years ago
Alternatives and similar repositories for Turkish-Text-to-Speech
Users that are interested in Turkish-Text-to-Speech are comparing it to the libraries listed below
Sorting:
- Turkish Speech Recognition using Facebook's Wav2vec 2.0 models☆31Updated 3 years ago
- Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITS☆57Updated 2 years ago
- A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tat…☆77Updated 2 years ago
- Verifying the identity of a person from characteristics of the voice independent from language via NVIDIA NeMo models (ECAPA-TDNN, Speake…☆41Updated 2 years ago
- ☆192Updated last year
- Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analy…☆74Updated 9 months ago
- SoTA open-source TTS☆134Updated 7 months ago
- Finetune VITS and MMS using HuggingFace's tools☆189Updated last year
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆52Updated 8 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆86Updated last year
- A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uy…☆79Updated 6 months ago
- VoiceHub: A Unified Inference Interface for TTS Models☆62Updated last month
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Updated 2 months ago
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆129Updated 2 years ago
- Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUI🐸TTS(Text-to-Speech) based high performing neural…☆43Updated 2 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆37Updated 2 years ago
- Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA☆102Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆27Updated 2 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆376Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆104Updated 5 months ago
- ☆49Updated 2 years ago
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆66Updated last year
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆258Updated last year
- A testing repo to share code and thoughts on diarisation☆57Updated last year
- ☆245Updated last month
- Create an LJSpeech structured voice dataset on wave input☆37Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆154Updated last year
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆106Updated 7 months ago
- A python package for deep multilingual punctuation prediction.☆156Updated last year