castorini / howl-deployLinks
JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice
β10Updated 5 years ago
Alternatives and similar repositories for howl-deploy
Users that are interested in howl-deploy are comparing it to the libraries listed below
Sorting:
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
 - Web app for keyword spotting using TensorflowJSβ74Updated 2 years ago
 - π Coqui's machine learning job schedulerβ31Updated 4 years ago
 - TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialogβ59Updated last year
 - Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationβ151Updated last year
 - Jupyter Notebooks for creating Speech datasetsβ46Updated 6 years ago
 - On-device voice activity detection (VAD) powered by deep learningβ232Updated last month
 - BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbotsβ115Updated 6 months ago
 - β76Updated 4 years ago
 - A Benchmark Dataset for Understanding Disfluencies in Question Answeringβ64Updated 4 years ago
 - A repo listing known open source voice tools, ordered by where they sit in the voice stackβ27Updated 3 years ago
 - Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.β214Updated last year
 - πΈSTT integration examplesβ129Updated 3 years ago
 - π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ259Updated 2 years ago
 - SEPIA server to support open-source speech recognition via WebSocket connection.β132Updated 11 months ago
 - OpenAI Whisper Prompt Examplesβ52Updated 2 years ago
 - Conversational AI Benchmark.β68Updated 2 years ago
 - PyTorch code for end-to-end spoken language understanding (SLU) with ASR-based transfer learningβ228Updated 4 years ago
 - A lightweight library to compute Diarization Error Rate (DER).β62Updated 2 years ago
 - Automatically constructing corpus for automatic speech recognition from YouTube videosβ155Updated 5 years ago
 - Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated 2 years ago
 - Code for AccentDB.β23Updated 4 years ago
 - Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecodeβ111Updated 3 years ago
 - The demo page of UniAudioβ34Updated last year
 - A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β107Updated 2 years ago
 - Datasets for turn-taking researchβ15Updated last year
 - Zero-shot Audio Classification using Whisperβ78Updated 2 years ago
 - Open TTS models, built for streaming on the edgeβ43Updated 7 months ago
 - Voice Activity Projection Models: Self-supervised learning of Turn-taking Eventsβ80Updated last year
 - Reproducible experimental protocols for multimedia (audio, video, text) databaseβ107Updated last month