castorini / howl-deployLinks
JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice
β10Updated 4 years ago
Alternatives and similar repositories for howl-deploy
Users that are interested in howl-deploy are comparing it to the libraries listed below
Sorting:
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialogβ55Updated last year
- πΈTTS recipes for different datasetsβ86Updated 2 years ago
- Web app for keyword spotting using TensorflowJSβ72Updated 2 years ago
- Speaker Diarization with Transformersβ68Updated last month
- π Coqui's machine learning job schedulerβ32Updated 3 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β100Updated 9 months ago
- OpenAI Whisper Prompt Examplesβ52Updated last year
- Dataset Release for Intent Classification from Speechβ47Updated 4 months ago
- Code for AccentDB.β22Updated 4 years ago
- Joint speech-language model - respond directly to audio!β30Updated last year
- The Gridspace-Stanford Harper Valley speech dataset. Created in support of CS224S.β46Updated 4 years ago
- proof of concept conversation orchestrator with a speech-language modelβ20Updated 8 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ104Updated 5 months ago
- Datasets for turn-taking researchβ14Updated last year
- Jupyter Notebooks for creating Speech datasetsβ46Updated 6 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationβ147Updated last year
- Add n-gram and large language model (LLM) support to Whisper models.β29Updated 2 months ago
- NPTEL2020: Speech2Text dataset for Indian-English Accentβ77Updated 3 years ago
- Repository for fine-tuning Transformers π€ based seq2seq speech models in JAX/Flax.β36Updated 2 years ago
- Open TTS models, built for streaming on the edgeβ43Updated 4 months ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text toβ¦β45Updated 4 years ago
- Conversational AI Benchmark.β68Updated 2 years ago
- β16Updated 4 months ago
- Coqui Inference Engineβ40Updated 3 years ago
- Putting flows on top of neural transducers for better TTSβ62Updated 3 weeks ago
- Speech-to-text based on wav2letter built for transfer learningβ97Updated 2 years ago
- Audio tokenization, in the fastest way possible!β52Updated 10 months ago
- Google's SoundStorm: Efficient Parallel Audio Generationβ132Updated last year
- BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbotsβ116Updated 2 months ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ102Updated 5 years ago