castorini / howlLinks
Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.
☆209Updated 11 months ago
Alternatives and similar repositories for howl
Users that are interested in howl are comparing it to the libraries listed below
Sorting:
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆175Updated 7 months ago
- This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).☆260Updated 3 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆107Updated 2 years ago
- Large, modern dataset for speech recognition☆678Updated last year
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆251Updated 11 months ago
- Onnx wrapper for espnet infrernce model☆164Updated 8 months ago
- On-device voice activity detection (VAD) powered by deep learning☆219Updated this week
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆293Updated 3 years ago
- Tools for Speech Enhancement integrated with Kaldi☆415Updated last year
- A list of publically available audio data that anyone can download for ASR or other speech activities☆210Updated 3 years ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆367Updated 2 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆422Updated 3 months ago
- Phonetisaurus G2P☆482Updated last year
- PyTorch implementations of neural network models for keyword spotting☆517Updated 2 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech proc…☆370Updated 2 weeks ago
- Voice Activity Detection (VAD) using deep learning.☆196Updated 5 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆317Updated 7 months ago
- 🐸STT integration examples☆129Updated 2 years ago
- DeepSpeech based forced alignment tool☆238Updated 4 years ago
- ESPnet Model Zoo☆254Updated last year
- This repository is a collection of TTS Models in TFLite☆195Updated 4 years ago
- In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…☆102Updated 2 years ago
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆202Updated this week
- Segment an audio file and obtain utterance alignments. (Python package)☆337Updated last year
- Open tools and data for cloudless automatic speech recognition☆446Updated 4 years ago
- Tools for handling multimodal data in machine learning projects.☆1,036Updated 3 weeks ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆542Updated 2 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆282Updated last year