daanzu / speech-training-recorderLinks
Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesis.
☆41Updated 4 years ago
Alternatives and similar repositories for speech-training-recorder
Users that are interested in speech-training-recorder are comparing it to the libraries listed below
Sorting:
- ☆75Updated 3 months ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- An online speech recognition extension toolkit of Kaldi☆56Updated 4 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Updated 3 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated 2 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆293Updated 4 years ago
- Online streaming speaker change detection model in Pytorch☆44Updated 2 years ago
- Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi☆83Updated 4 years ago
- python wrapper for rnnoise library☆49Updated 3 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Real-time speech enhancement mobile app using Nested U-Net☆54Updated 2 years ago
- Pytorch based speech enhancement toolkit.☆337Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆153Updated last year
- Keyword Search Recipe for Subword ASR☆30Updated 6 years ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆90Updated 9 months ago
- Goodness of Pronunciation algorithm using PyKaldi☆18Updated 3 years ago
- a python library for speech enhancement☆82Updated last year
- Pytorch implementation of Deepmind's WaveRNN model☆123Updated 6 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆35Updated 6 years ago
- ☆13Updated 4 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆33Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆241Updated last week
- ☆66Updated last year
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆103Updated 9 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆56Updated 7 months ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆68Updated 4 years ago
- ☆41Updated 4 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Updated 2 years ago
- Long audio alignment using Kaldi☆23Updated 4 years ago