castorini / howl
Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.
☆194Updated last month
Related projects: ⓘ
- This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).☆236Updated 2 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆288Updated 3 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆319Updated 4 months ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆160Updated 2 months ago
- DeepSpeech based forced alignment tool☆232Updated 3 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆98Updated last year
- This repository is a collection of TTS Models in TFLite☆186Updated 3 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆508Updated last year
- End-to-end speech recognition using RNN Transducers in Tensorflow 2.0☆241Updated 3 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆198Updated 3 years ago
- Large, modern dataset for speech recognition☆629Updated 6 months ago
- ☆250Updated last year
- wake word engine benchmark framework☆132Updated 2 years ago
- Open tools and data for cloudless automatic speech recognition☆443Updated 3 years ago
- Diarization scoring tools.☆213Updated last year
- Speaker embedding (d-vector) trained with GE2E loss☆270Updated 8 months ago
- ESPnet Model Zoo☆242Updated last year
- Variational Bayes HMM over x-vectors diarization☆251Updated 8 months ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆272Updated 2 months ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech proc…☆363Updated this week
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆217Updated last month
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆184Updated last year
- Tools for Speech Enhancement integrated with Kaldi☆395Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆165Updated 2 weeks ago
- Voice Activity Detection (VAD) using deep learning.☆190Updated 4 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆347Updated this week
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆421Updated last year
- In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…☆90Updated last year
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection☆373Updated last year