daanzu / speech-training-recorder
Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesis.
☆40Updated 3 years ago
Alternatives and similar repositories for speech-training-recorder:
Users that are interested in speech-training-recorder are comparing it to the libraries listed below
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 5 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated last year
- STT Service based on Kaldi ASR☆15Updated 6 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆20Updated 3 years ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆28Updated 9 months ago
- 🐸STT integration examples☆126Updated 2 years ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Updated 6 years ago
- Deep Convolution Text to Speech☆35Updated 7 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆109Updated 2 years ago
- Long audio alignment using Kaldi☆24Updated 3 years ago
- 🐸TTS recipes for different datasets☆86Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆94Updated 5 months ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- On-device voice activity detection (VAD) powered by deep learning☆202Updated this week
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆64Updated 4 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆98Updated 2 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆291Updated 3 years ago
- ☆17Updated last year
- a python library for speech enhancement☆78Updated 8 months ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆81Updated 9 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆340Updated last year
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆10Updated 3 months ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- Docker images for Coqui AI☆57Updated 3 years ago
- Persian Consonant Vowel Combination (PCVC) Speech Dataset☆19Updated 4 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- BurrMill core☆21Updated 3 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆75Updated 3 years ago