rhasspy / rhasspy-silenceLinks
Silence detection in audio stream using webrtcvad
β49Updated 2 years ago
Alternatives and similar repositories for rhasspy-silence
Users that are interested in rhasspy-silence are comparing it to the libraries listed below
Sorting:
- πΈSTT integration examplesβ130Updated 3 years ago
- Pytorch implementation of Deepmind's WaveRNN modelβ123Updated 6 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β106Updated 2 years ago
- Official home of the Idlak Speech Synthesis Toolkitβ67Updated 4 years ago
- β76Updated 4 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ103Updated 5 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".β29Updated 4 years ago
- Model for recasing and repunctuating ASR transcriptsβ143Updated last year
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated 2 years ago
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β332Updated last year
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphoneβ35Updated 3 years ago
- An HTML interface for finetuning the sync map output from aeneasβ53Updated 3 years ago
- Adapting your own Language Model for Kaldiβ63Updated 7 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language modelβ33Updated 6 years ago
- Multilingual Grapheme to Phonemeβ50Updated 9 years ago
- Support tools for punctuation and boundary detection for ASR output.β55Updated 3 years ago
- wake word engine benchmark frameworkβ150Updated 3 weeks ago
- Use your data to create a speech recognition system in Kaldi. Fast.β65Updated 6 years ago
- ASRecognition: just an easy-to-use library for Automatic Speech Recognition.β51Updated 2 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 4 years ago
- speaker diarization system using an LSTMβ50Updated 3 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterancesβ50Updated last year
- Spoken Language Identification on Common Voice and AudioSet using Deep Learningβ41Updated 3 years ago
- Neural HMMs are all you need (for high-quality attention-free TTS)β164Updated last week
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separationβ173Updated 3 years ago
- β263Updated 3 years ago
- Zero-shot Audio Classification using Whisperβ79Updated 3 years ago
- Various algorithms for voice activity detectionβ22Updated 9 years ago
- Python library for handling audio datasets.β138Updated 2 years ago