dscripka / synthetic_speech_dataset_generationLinks
This repository contains text-to-speech (TTS) models and utilities designed produce synthetic training datasets for other speech-related models.
☆21Updated 2 years ago
Alternatives and similar repositories for synthetic_speech_dataset_generation
Users that are interested in synthetic_speech_dataset_generation are comparing it to the libraries listed below
Sorting:
- A simple, but performant framework for mapping speech directly to categories and intents.☆21Updated last year
- An open-source, easily accessible package for training and deploying Speech-to-Intent models on microcontrollers and SBCs☆45Updated last year
- Goodness of Pronunciation algorithm using PyKaldi☆16Updated 3 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 4 years ago
- ☆17Updated 4 years ago
- Python bindings of speexdsp noise suppression library☆40Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- ☆13Updated last year
- Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"☆12Updated 5 years ago
- Indic-Conformer models for ASR☆18Updated last year
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆17Updated 2 years ago
- A pipeline to isolate and transcribe one language in mixed-language speech☆19Updated 2 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆28Updated 3 years ago
- ☆47Updated 2 years ago
- Tensorflow-based wake word detection☆14Updated 10 months ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆17Updated 2 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆28Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- ☆25Updated 3 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- A handy dataset of noises for ASR☆22Updated 6 years ago
- Speech to text library for Rhasspy using Kaldi☆14Updated last year
- Evaluation of STT models for german language☆15Updated 3 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- ☆11Updated 3 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Updated 3 years ago