dscripka / synthetic_speech_dataset_generationLinks
This repository contains text-to-speech (TTS) models and utilities designed produce synthetic training datasets for other speech-related models.
☆24Updated 2 years ago
Alternatives and similar repositories for synthetic_speech_dataset_generation
Users that are interested in synthetic_speech_dataset_generation are comparing it to the libraries listed below
Sorting:
- Indic-Conformer models for ASR☆18Updated last year
 - ☆17Updated 4 years ago
 - A pipeline to isolate and transcribe one language in mixed-language speech☆19Updated 3 years ago
 - An open-source, easily accessible package for training and deploying Speech-to-Intent models on microcontrollers and SBCs☆47Updated last year
 - This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆18Updated last year
 - Goodness of Pronunciation algorithm using PyKaldi☆16Updated 3 years ago
 - ☆44Updated 2 years ago
 - A merged version of multiple open-source German speech datasets.☆33Updated last year
 - A simple, but performant framework for mapping speech directly to categories and intents.☆22Updated last year
 - Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 5 years ago
 - Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Updated 3 years ago
 - Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"☆12Updated 5 years ago
 - Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Updated 4 years ago
 - ☆25Updated 3 years ago
 - ☆16Updated 2 years ago
 - SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
 - This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆29Updated last week
 - Python bindings of speexdsp noise suppression library☆41Updated 2 years ago
 - NPTEL2020: Speech2Text dataset for Indian-English Accent☆77Updated 3 years ago
 - C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
 - Tensorflow-based wake word detection☆16Updated 2 weeks ago
 - Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
 - ☆13Updated last year
 - Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 6 years ago
 - A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Updated 2 years ago
 - ☆48Updated 2 years ago
 - The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Updated 2 years ago
 - ☆19Updated 7 months ago
 - A composition of offline tools to achieve high quality multilingual speech to text transcription☆22Updated 2 months ago
 - Zero-shot Audio Classification using Whisper☆78Updated 2 years ago