rhasspy / rhasspy-silence
Silence detection in audio stream using webrtcvad
☆46Updated last year
Alternatives and similar repositories for rhasspy-silence:
Users that are interested in rhasspy-silence are comparing it to the libraries listed below
- Voice Activity Detection (VAD) using deep learning.☆193Updated 5 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 5 years ago
- Official home of the Idlak Speech Synthesis Toolkit☆66Updated 3 years ago
- wake word engine benchmark framework☆131Updated 3 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- 🐸STT integration examples☆122Updated 2 years ago
- Python library for audio augmentation☆83Updated last year
- A simple audio feature extraction library☆79Updated 5 years ago
- Deep Neural Network for Speaker Count Estimation☆146Updated 4 years ago
- Deep understanding and modelling of the hierarchical structure of prosody☆22Updated 5 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- Adapting your own Language Model for Kaldi☆64Updated 6 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 5 years ago
- ☆254Updated 2 years ago
- DeepSpeech based forced alignment tool☆235Updated 4 years ago
- Zero-shot Audio Classification using Whisper☆77Updated 2 years ago
- An HTML interface for finetuning the sync map output from aeneas☆53Updated 2 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆102Updated 2 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆129Updated last month
- An automatic speech recognition API☆48Updated this week
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆142Updated last year
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆106Updated 9 months ago
- Open tools and data for cloudless automatic speech recognition☆446Updated 3 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆68Updated last year
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- ☆74Updated 3 years ago
- Server framework for Kaldi ASR Toolkit☆97Updated last year
- Desktop application for neural speech synthesis written in C++☆212Updated last year
- How to create your own model for vosk☆65Updated 3 years ago
- A Python toolbox for speech features extraction☆160Updated last year