castorini / howl-deploy
JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice
☆10Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for howl-deploy
- Web app for keyword spotting using TensorflowJS☆69Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆99Updated last year
- automatically align transcribed audio and generate a wav2letter training corpus☆35Updated last year
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆72Updated 2 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆84Updated last month
- 🐸TTS recipes for different datasets☆84Updated 2 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆101Updated 4 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆106Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- A phoneme-allophone database for many languages☆48Updated 4 years ago
- Speech-to-text based on wav2letter built for transfer learning☆96Updated 2 years ago
- ☆38Updated 2 years ago
- Various speech datasets made available to the public☆99Updated last month
- Coqui Inference Engine☆38Updated 3 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆200Updated 3 months ago
- On-device voice activity detection (VAD) powered by deep learning☆179Updated this week
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆110Updated 2 years ago
- ☆77Updated 5 months ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 2 years ago
- This repository is a collection of TTS Models in TFLite☆189Updated 3 years ago
- Dataset Release for Intent Classification from Speech☆46Updated last year
- DeepSpeech based forced alignment tool☆234Updated 3 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆83Updated last month
- Jupyter Notebooks for creating Speech datasets☆46Updated 5 years ago
- Zero-shot Audio Classification using Whisper☆74Updated last year
- Code for AccentDB.☆19Updated 3 years ago
- Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and c…☆43Updated 2 years ago
- asr2k☆48Updated 5 months ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆134Updated 10 months ago