Rumeysakeskin / Turkish-Text-to-SpeechLinks
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
☆64Updated 2 years ago
Alternatives and similar repositories for Turkish-Text-to-Speech
Users that are interested in Turkish-Text-to-Speech are comparing it to the libraries listed below
Sorting:
- Turkish Speech Recognition using Facebook's Wav2vec 2.0 models☆31Updated 3 years ago
- A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tat…☆76Updated 2 years ago
- Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analy…☆74Updated 9 months ago
- ☆185Updated last year
- Finetune VITS and MMS using HuggingFace's tools☆188Updated last year
- Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUI🐸TTS(Text-to-Speech) based high performing neural…☆43Updated 2 years ago
- A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uy…☆77Updated 5 months ago
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆130Updated 2 years ago
- Verifying the identity of a person from characteristics of the voice independent from language via NVIDIA NeMo models (ECAPA-TDNN, Speake…☆40Updated 2 years ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆52Updated 7 months ago
- ☆243Updated 3 weeks ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆86Updated last year
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆257Updated last year
- SoTA open-source TTS☆128Updated 7 months ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- VoiceHub: A Unified Inference Interface for TTS Models☆62Updated 3 weeks ago
- [WIP] VoiceSmith makes training text to speech models easy.☆228Updated 3 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Updated last month
- My guide to create an italian TTS with Coqui☆14Updated 3 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Updated last year
- finetune llm part for spark-tts model☆119Updated 9 months ago
- Onnx compatible styletts2 code☆16Updated 7 months ago
- Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITS☆57Updated 2 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆37Updated 2 years ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆103Updated 4 months ago
- Uses machine learning to denoise audio containing speech☆48Updated last year
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆138Updated 3 months ago
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"☆103Updated 6 months ago
- 🎙️ Arabic TTS models (Tacotron2, FastPitch)☆135Updated 3 weeks ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆153Updated last year