castorini / howlLinks
Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.
☆209Updated 11 months ago
Alternatives and similar repositories for howl
Users that are interested in howl are comparing it to the libraries listed below
Sorting:
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆175Updated 7 months ago
- This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).☆260Updated 3 years ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆367Updated 2 years ago
- Large, modern dataset for speech recognition☆678Updated last year
- Grapheme to phoneme conversion with deep learning.☆389Updated last year
- Diarization scoring tools.☆250Updated 2 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆107Updated 2 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆537Updated 3 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆337Updated last year
- Open tools and data for cloudless automatic speech recognition☆446Updated 4 years ago
- Tools for Speech Enhancement integrated with Kaldi☆415Updated 2 years ago
- DeepSpeech based forced alignment tool☆238Updated 4 years ago
- 🐸STT integration examples☆129Updated 2 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆282Updated last year
- Towards hot directions in industrial end to end speech recognition☆325Updated 3 years ago
- Variational Bayes HMM over x-vectors diarization☆272Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆219Updated this week
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech proc…☆370Updated 2 weeks ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆317Updated 7 months ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆293Updated 3 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- This repository is a collection of TTS Models in TFLite☆195Updated 4 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆422Updated 3 months ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆251Updated 11 months ago
- Kaldi model converter to ONNX☆244Updated 2 years ago
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆135Updated last year
- End-to-end speech recognition using RNN Transducers in Tensorflow 2.0☆246Updated last week
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆464Updated last year
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection☆380Updated 2 years ago
- Tools for handling multimodal data in machine learning projects.☆1,036Updated 3 weeks ago