rhasspy / rhasspy-silence
Silence detection in audio stream using webrtcvad
β46Updated 9 months ago
Related projects: β
- πΈSTT integration examplesβ118Updated last year
- How to create your own model for voskβ63Updated 3 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.β65Updated 4 years ago
- Python library for audio augmentationβ83Updated last year
- Model for recasing and repunctuating ASR transcriptsβ126Updated 5 months ago
- A crash course for training speech recognition models using DeepSpeech.β23Updated 3 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β272Updated 2 months ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β100Updated last year
- Pytorch implementation of Deepmind's WaveRNN modelβ120Updated 5 years ago
- β31Updated 2 weeks ago
- Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice qualityβ19Updated 5 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)β56Updated 4 years ago
- Grapheme To Phonemeβ69Updated last month
- Linguistic processing for Common Voiceβ50Updated 8 months ago
- β75Updated 3 months ago
- A collection of basic python modules for spoken natural language processingβ56Updated 4 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.β81Updated last year
- An automatic speech recognition APIβ40Updated 2 weeks ago
- Data and code for grapheme-to-phoneme transducers in lots of languagesβ128Updated 5 months ago
- A lightweight library to compute Diarization Error Rate (DER).β59Updated last year
- Speaker diarization python system based on binary key speaker modellingβ61Updated 2 years ago
- β38Updated 2 years ago
- Grapheme to phoneme conversion with deep learning.β349Updated 9 months ago
- πΈTTS recipes for different datasetsβ84Updated 2 years ago
- [deprecated] Pretrained models for pyannote-audio 1.xβ70Updated 2 years ago
- speaker diarization system using an LSTMβ49Updated last year
- An online speech recognition extension toolkit of Kaldiβ57Updated 3 years ago
- On-device noise suppression powered by deep learningβ59Updated 2 weeks ago
- DeepSpeech based forced alignment toolβ232Updated 3 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone β¦β41Updated 2 years ago