dscripka / synthetic_speech_dataset_generationLinks
This repository contains text-to-speech (TTS) models and utilities designed produce synthetic training datasets for other speech-related models.
☆26Updated 2 years ago
Alternatives and similar repositories for synthetic_speech_dataset_generation
Users that are interested in synthetic_speech_dataset_generation are comparing it to the libraries listed below
Sorting:
- A simple, but performant framework for mapping speech directly to categories and intents.☆23Updated last year
- Indic-Conformer models for ASR☆20Updated last year
- ☆17Updated 4 years ago
- An open-source, easily accessible package for training and deploying Speech-to-Intent models on microcontrollers and SBCs☆47Updated last year
- Python bindings of speexdsp noise suppression library☆45Updated 3 years ago
- This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)☆16Updated 3 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 5 years ago
- ☆44Updated 3 years ago
- ☆11Updated 4 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Tensorflow-based wake word detection☆17Updated last month
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆68Updated 6 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆62Updated 2 years ago
- ☆49Updated 3 years ago
- Zero-shot Audio Classification using Whisper☆79Updated 3 years ago
- Mike/Projects/pysilero-vad.git☆22Updated 2 weeks ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆23Updated 4 years ago
- ☆49Updated 2 years ago
- Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"☆12Updated 5 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Updated 5 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Updated 9 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- Add n-gram and large language model (LLM) support to Whisper models.☆39Updated 7 months ago
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆78Updated 4 years ago
- Speech to text library for Rhasspy using Kaldi☆15Updated 2 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆29Updated 4 years ago
- A pipeline to isolate and transcribe one language in mixed-language speech☆19Updated 3 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 3 years ago
- A set of tools for working with accent data in Mozilla's Common Voice dataset☆14Updated 2 years ago