dscripka / synthetic_speech_dataset_generationLinks
This repository contains text-to-speech (TTS) models and utilities designed produce synthetic training datasets for other speech-related models.
☆27Updated 2 years ago
Alternatives and similar repositories for synthetic_speech_dataset_generation
Users that are interested in synthetic_speech_dataset_generation are comparing it to the libraries listed below
Sorting:
- Indic-Conformer models for ASR☆20Updated last year
- A simple, but performant framework for mapping speech directly to categories and intents.☆25Updated last year
- An open-source, easily accessible package for training and deploying Speech-to-Intent models on microcontrollers and SBCs☆48Updated last year
- Tensorflow-based wake word detection☆17Updated 3 months ago
- Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"☆12Updated 5 years ago
- A pipeline to isolate and transcribe one language in mixed-language speech☆19Updated 3 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 3 years ago
- ☆17Updated 4 years ago
- Zero-shot Audio Classification using Whisper☆79Updated 3 years ago
- Speech Emotion Recognition☆43Updated 2 years ago
- Python bindings of speexdsp noise suppression library☆46Updated 3 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆23Updated 4 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆37Updated 2 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- ☆49Updated 3 years ago
- Mike/Projects/pysilero-vad.git☆24Updated 2 weeks ago
- ☆49Updated 2 years ago
- ☆45Updated 3 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆18Updated 3 years ago
- ☆25Updated 3 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Updated 10 months ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆215Updated last year
- Add n-gram and large language model (LLM) support to Whisper models.☆40Updated 8 months ago
- ☆11Updated 4 years ago
- All-in-one Speech Transcription☆10Updated this week
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆52Updated 3 years ago
- SpeechYOLO Interspeech 2019☆46Updated 3 years ago
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Updated 4 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated 2 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago