dscripka / synthetic_speech_dataset_generation
This repository contains text-to-speech (TTS) models and utilities designed produce synthetic training datasets for other speech-related models.
☆17Updated last year
Alternatives and similar repositories for synthetic_speech_dataset_generation:
Users that are interested in synthetic_speech_dataset_generation are comparing it to the libraries listed below
- Python bindings of speexdsp noise suppression library☆36Updated 2 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆15Updated 2 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- ☆12Updated last year
- A Kaldi/ESPnet based approach to perform automatic speech recognition on low resource languages☆9Updated 3 years ago
- A pipeline to isolate and transcribe one language in mixed-language speech☆18Updated 2 years ago
- wake word spotting with kaldi☆19Updated 4 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 11 months ago
- Codebase for "Transcription free filler word detection with Neural semi-CRFs" [ICASSP2023]☆8Updated 6 months ago
- ☆17Updated 3 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆18Updated 3 years ago
- Word Error Rate Estimation☆11Updated 4 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- ☆14Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- ☆11Updated 3 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 2 years ago
- Perform the forced decoding with target transcription☆11Updated 6 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆10Updated 4 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆14Updated 2 years ago
- ☆11Updated 3 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆24Updated 3 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated 10 months ago
- ☆41Updated 2 years ago
- Keyword Spotting (KWS) API wrapper for TFLite streaming models.☆12Updated 3 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆10Updated 11 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Zero-shot Audio Classification using Whisper☆77Updated 2 years ago