dscripka / synthetic_speech_dataset_generationLinks
This repository contains text-to-speech (TTS) models and utilities designed produce synthetic training datasets for other speech-related models.
☆19Updated 2 years ago
Alternatives and similar repositories for synthetic_speech_dataset_generation
Users that are interested in synthetic_speech_dataset_generation are comparing it to the libraries listed below
Sorting:
- Python bindings of speexdsp noise suppression library☆39Updated 2 years ago
- An open-source, easily accessible package for training and deploying Speech-to-Intent models on microcontrollers and SBCs☆44Updated last year
- Target speaker automatic speech recognition (TS-ASR)☆11Updated last year
- Speech to text library for Rhasspy using Kaldi☆14Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- ☆12Updated 4 months ago
- Voice Framework☆14Updated 2 months ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Keyword Spotting (KWS) API wrapper for TFLite streaming models.☆12Updated 3 years ago
- One command to start a streaming ASR server.☆12Updated 8 months ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Updated 4 years ago
- ☆14Updated 11 months ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 2 years ago
- ☆9Updated 5 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- A pipeline to isolate and transcribe one language in mixed-language speech☆18Updated 2 years ago
- ☆15Updated 2 months ago
- Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"☆12Updated 5 years ago
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.☆10Updated 4 months ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- Russian phonetical transcription☆10Updated last year
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 3 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆16Updated 3 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- The project is related to the development of labs for the ITMO Speaker Recognition Course.☆10Updated last month
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆10Updated last month
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆21Updated last year
- ☆11Updated 3 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 4 years ago