castorini / howl
Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.
☆205Updated 9 months ago
Alternatives and similar repositories for howl
Users that are interested in howl are comparing it to the libraries listed below
Sorting:
- This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).☆258Updated 2 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆107Updated 2 years ago
- Large, modern dataset for speech recognition☆674Updated last year
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆172Updated 5 months ago
- On-device voice activity detection (VAD) powered by deep learning☆214Updated last week
- Grapheme to phoneme conversion with deep learning.☆382Updated last year
- wake word engine benchmark framework☆135Updated 3 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆335Updated last year
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech proc…☆365Updated 5 months ago
- 🐸STT integration examples☆126Updated 2 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection☆379Updated 2 years ago
- ESPnet Model Zoo☆250Updated last year
- Open tools and data for cloudless automatic speech recognition☆447Updated 4 years ago
- This repository is a collection of TTS Models in TFLite☆192Updated 4 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆139Updated this week
- Code for Temporal Convolution for Real-time Keyword Spotting on Mobile Devices☆223Updated 2 years ago
- Tools for Speech Enhancement integrated with Kaldi☆413Updated last year
- ☆258Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆310Updated 6 months ago
- ☆123Updated 4 years ago
- PyTorch implementations of neural network models for keyword spotting☆516Updated last year
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆293Updated 3 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆230Updated 2 years ago
- Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769☆130Updated 3 years ago
- A library for speech data augmentation in time-domain☆661Updated 3 years ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- End-to-end speech recognition using RNN Transducers in Tensorflow 2.0☆245Updated 4 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆282Updated last year
- Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.☆617Updated last year