castorini / howl-deploy
JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice
β10Updated 4 years ago
Alternatives and similar repositories for howl-deploy:
Users that are interested in howl-deploy are comparing it to the libraries listed below
- Web app for keyword spotting using TensorflowJSβ71Updated 2 years ago
- πΈTTS recipes for different datasetsβ86Updated 2 years ago
- Tunable pipelinesβ31Updated last month
- Various speech datasets made available to the publicβ114Updated 3 months ago
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialogβ48Updated 10 months ago
- β80Updated 10 months ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 3 years ago
- OpenAI Whisper Prompt Examplesβ52Updated last year
- Dataset Release for Intent Classification from Speechβ46Updated last month
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text toβ¦β44Updated 3 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networksβ64Updated 4 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ98Updated last month
- Code for AccentDB.β20Updated 3 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationβ143Updated last year
- Audio tokenization, in the fastest way possible!β49Updated 6 months ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.β203Updated 8 months ago
- Tool for creation, manipulation and maintenance of voice corporaβ81Updated 10 months ago
- Coqui Inference Engineβ38Updated 3 years ago
- Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and cβ¦β43Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ109Updated 2 years ago
- β74Updated last year
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ102Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β101Updated 2 years ago
- A python package for whisper normalizerβ53Updated 3 weeks ago
- A phoneme-allophone database for many languagesβ51Updated 4 years ago
- Speaker Diarization with Transformersβ64Updated 10 months ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated last year
- β74Updated 3 years ago
- A lightweight library to compute Diarization Error Rate (DER).β59Updated last year