castorini / howl-deploy
JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice
☆10Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for howl-deploy
- Web app for keyword spotting using TensorflowJS☆69Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆98Updated last year
- 🐸TTS recipes for different datasets☆84Updated 2 years ago
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog☆42Updated 5 months ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆106Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- ☆65Updated last year
- Jupyter Notebooks for creating Speech datasets☆46Updated 5 years ago
- A TTS model that makes a speaker speak new languages☆75Updated 4 months ago
- ☆74Updated 3 years ago
- A phoneme-allophone database for many languages☆48Updated 4 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆83Updated last month
- ☆77Updated 5 months ago
- ☆42Updated 5 months ago
- Zero-shot Audio Classification using Whisper☆74Updated last year
- Various speech datasets made available to the public☆98Updated last month
- Using speaker embedding for diarization in PyTorch☆18Updated 4 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 3 months ago
- A collection of utilities for handling IPA phones.☆24Updated last year
- Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and c…☆43Updated 2 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆34Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆83Updated 3 weeks ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆110Updated 2 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆120Updated this week
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆134Updated this week
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆19Updated 2 months ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆199Updated 3 months ago
- ☆33Updated 3 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year