daanzu / speech-training-recorderLinks
Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesis.
☆41Updated 3 years ago
Alternatives and similar repositories for speech-training-recorder
Users that are interested in speech-training-recorder are comparing it to the libraries listed below
Sorting:
- STT Service based on Kaldi ASR☆15Updated 6 years ago
- python wrapper for rnnoise library☆48Updated 2 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 5 years ago
- ☆17Updated 4 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆116Updated 2 years ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆22Updated 7 months ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆293Updated 3 years ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Updated 7 months ago
- An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)☆123Updated 11 months ago
- Pytorch based speech enhancement toolkit.☆337Updated last year
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Updated 6 years ago
- 🐸STT integration examples☆129Updated 2 years ago
- Multilingual Grapheme to Phoneme☆50Updated 9 years ago
- Online streaming speaker change detection model in Pytorch☆40Updated 2 years ago
- 🐸TTS recipes for different datasets☆86Updated 2 years ago
- wake word spotting with kaldi☆19Updated 4 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated 2 years ago
- ☆25Updated 3 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 6 years ago
- Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi☆67Updated 3 years ago
- Desktop application for neural speech synthesis written in C++☆215Updated 2 years ago
- Text frontend for ESPnet tts recipes☆34Updated 4 years ago
- ☆10Updated 2 years ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆29Updated last year
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 5 years ago
- Add n-gram and large language model (LLM) support to Whisper models.☆29Updated 2 months ago