rhasspy / rhasspy-silenceLinks
Silence detection in audio stream using webrtcvad
β48Updated last year
Alternatives and similar repositories for rhasspy-silence
Users that are interested in rhasspy-silence are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of Deepmind's WaveRNN modelβ121Updated 6 years ago
- πΈSTT integration examplesβ130Updated 2 years ago
- An HTML interface for finetuning the sync map output from aeneasβ53Updated 3 years ago
- Model for recasing and repunctuating ASR transcriptsβ136Updated last year
- β76Updated 3 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.β65Updated 5 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β101Updated 2 years ago
- python wrapper for rnnoise libraryβ48Updated 2 years ago
- Adapting your own Language Model for Kaldiβ63Updated 6 years ago
- Speaker diarization python system based on binary key speaker modellingβ60Updated 3 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".β27Updated 3 years ago
- wake word engine benchmark frameworkβ139Updated 3 years ago
- Advanced data structures for handling temporal segments with attached labels.β114Updated 6 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ149Updated last year
- Some simple wrappers around eSpeak NG intended to make using this excellent TTS for waveform and IPA generation as convenient as possibleβ¦β42Updated 10 months ago
- Multistream CNN for Robust Acoustic Modelingβ40Updated 4 years ago
- Official home of the Idlak Speech Synthesis Toolkitβ66Updated 3 years ago
- Grapheme To Phonemeβ73Updated last year
- Tools to create your own voice dataset for TTS trainingβ67Updated 4 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learningβ40Updated 3 years ago
- speaker diarization system using an LSTMβ50Updated 2 years ago
- Multilingual Grapheme to Phonemeβ50Updated 9 years ago
- A lightweight library to compute Diarization Error Rate (DER).β60Updated last year
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ102Updated 5 years ago
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- ESPnet Model Zooβ255Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β321Updated 8 months ago
- DeepSpeech based forced alignment toolβ238Updated 4 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speechβ229Updated 3 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation suppβ¦β48Updated 2 years ago