rhasspy / rhasspy-silence
Silence detection in audio stream using webrtcvad
☆46Updated last year
Alternatives and similar repositories for rhasspy-silence:
Users that are interested in rhasspy-silence are comparing it to the libraries listed below
- ☆74Updated 3 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 5 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- How to create your own model for vosk☆70Updated 3 years ago
- Official home of the Idlak Speech Synthesis Toolkit☆66Updated 3 years ago
- Grapheme To Phoneme☆70Updated 7 months ago
- Python library for audio augmentation☆83Updated last year
- Paper: https://arxiv.org/abs/1702.02285☆63Updated 6 years ago
- Linguistic processing for Common Voice☆55Updated last year
- 🐸STT integration examples☆126Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆80Updated 2 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 6 years ago
- A simple audio feature extraction library☆79Updated 5 years ago
- wake word engine benchmark framework☆133Updated 3 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- ASRecognition: just an easy-to-use library for Automatic Speech Recognition.☆51Updated 2 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 5 years ago
- ☆35Updated last week
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- An HTML interface for finetuning the sync map output from aeneas☆53Updated 2 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆109Updated 2 years ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Updated 6 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆61Updated 4 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆15Updated 5 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- Multistream CNN for Robust Acoustic Modeling☆40Updated 3 years ago
- Adapting your own Language Model for Kaldi☆64Updated 6 years ago