Rumeysakeskin / Turkish-Text-to-Speech
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
☆43Updated 9 months ago
Related projects: ⓘ
- ☆31Updated this week
- Turkish Speech Recognition using Facebook's Wav2vec 2.0 models☆19Updated 2 years ago
- ☆16Updated this week
- ☆11Updated this week
- ☆17Updated this week
- Verifying the identity of a person from characteristics of the voice independent from language via NVIDIA NeMo models (ECAPA-TDNN, Speake…☆33Updated last year
- ☆24Updated last year
- A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tat…☆46Updated last year
- InceptionV3-Multi-layer GRU based automatic image captioning with Keras and TensorFlow frameworks☆19Updated last year
- Download speech datasets (English and non-English) for Automatic Speech Recognition☆13Updated last year
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!☆23Updated last year
- My guide to create an italian TTS with Coqui☆12Updated 2 years ago
- ☆17Updated last year
- spaCyTurk - trained models & pipelines for Turkish☆17Updated 2 years ago
- Uses machine learning to denoise audio containing speech☆28Updated 2 months ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆22Updated last year
- Finetune VITS and MMS using HuggingFace's tools☆112Updated 5 months ago
- ☆48Updated last year
- Repo for spaCy Turkish model development.☆57Updated last year
- Turkish Vision Language Model Development And Research☆13Updated last month
- RADTTS + HiFiGAN vocoder☆10Updated last year
- This repository contains the source code for the implementation of two deep learning models concerning the audio super resolution task.☆12Updated last year
- Context-Sensitive Neural Spelling Checker☆18Updated 8 months ago
- Automatic image captioning on Android-based mobile application with CNN and multi-layer GRU encoder-decoder model☆14Updated last year
- Create training data for training a voice cloner for bark text to speech.☆44Updated last year
- ☆13Updated last year
- Tools to create your own voice dataset for TTS training☆58Updated 3 years ago
- Finally, some decent sample sentences☆21Updated 9 months ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆21Updated last month
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆24Updated last year