daanzu / speech-training-recorderLinks
Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesis.
☆41Updated 4 years ago
Alternatives and similar repositories for speech-training-recorder
Users that are interested in speech-training-recorder are comparing it to the libraries listed below
Sorting:
- python wrapper for rnnoise library☆49Updated 2 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆34Updated 6 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆293Updated 4 years ago
- ☆92Updated last year
- Online streaming speaker change detection model in Pytorch☆43Updated 2 years ago
- a python library for speech enhancement☆81Updated last year
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆108Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- An online speech recognition extension toolkit of Kaldi☆56Updated 4 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated 2 years ago
- Pytorch based speech enhancement toolkit.☆336Updated last year
- ☆69Updated last month
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆214Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated 2 months ago
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆77Updated 3 years ago
- ☆65Updated last year
- Clustering-based methods for overlapping diarization☆81Updated last year
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆99Updated 3 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆233Updated last week
- Python package for combining diarization system outputs.☆90Updated 2 years ago
- Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi☆76Updated 3 years ago
- ☆43Updated last year
- Real-time speech enhancement mobile app using Nested U-Net☆53Updated 2 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Updated 3 years ago
- Spot the conversation: speaker diarisation in the wild☆153Updated 3 years ago
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆345Updated 2 weeks ago
- ☆91Updated 2 weeks ago
- A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.☆67Updated 6 years ago
- Long audio alignment using Kaldi☆23Updated 4 years ago