castorini / howl-deployLinks
JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice
β10Updated 5 years ago
Alternatives and similar repositories for howl-deploy
Users that are interested in howl-deploy are comparing it to the libraries listed below
Sorting:
- Web app for keyword spotting using TensorflowJSβ74Updated 3 years ago
- π Coqui's machine learning job schedulerβ31Updated 4 years ago
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialogβ63Updated last year
- Buildings block for voice-enabled applications in the browserβ37Updated last week
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbotsβ116Updated 9 months ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ103Updated 5 years ago
- Joint speech-language model - respond directly to audio!β30Updated last year
- On-device voice activity detection (VAD) powered by deep learningβ242Updated 3 weeks ago
- Open TTS models, built for streaming on the edgeβ45Updated 10 months ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationβ150Updated 2 years ago
- Datasets for turn-taking researchβ17Updated 2 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ27Updated 3 years ago
- β76Updated 4 years ago
- Speech-to-text based on wav2letter built for transfer learningβ98Updated 3 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.β83Updated 2 years ago
- Jupyter Notebooks for creating Speech datasetsβ46Updated 6 years ago
- Zero-shot Audio Classification using Whisperβ79Updated 3 years ago
- vadβ25Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentatiβ¦β41Updated 3 years ago
- Conversational AI Benchmark.β68Updated 2 years ago
- Gecko - A Tool for Effective Annotation of Human Conversationsβ301Updated 2 months ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.β16Updated 4 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.β135Updated last year
- Audio tokenization, in the fastest way possible!β53Updated last year
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecodeβ111Updated 3 years ago
- Putting flows on top of neural transducers for better TTSβ65Updated 3 weeks ago
- πΈSTT integration examplesβ130Updated 3 years ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answeringβ64Updated 4 years ago