castorini / howl-deployLinks
JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice
β10Updated 5 years ago
Alternatives and similar repositories for howl-deploy
Users that are interested in howl-deploy are comparing it to the libraries listed below
Sorting:
- Web app for keyword spotting using TensorflowJSβ74Updated 2 years ago
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- Buildings block for voice-enabled applications in the browserβ37Updated 5 months ago
- On-device voice activity detection (VAD) powered by deep learningβ229Updated 2 weeks ago
- Zero-shot Audio Classification using Whisperβ78Updated 2 years ago
- π Coqui's machine learning job schedulerβ32Updated 4 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ103Updated 5 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.β131Updated 11 months ago
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialogβ58Updated last year
- Speech-to-text based on wav2letter built for transfer learningβ98Updated 2 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated 2 years ago
- Audio tokenization, in the fastest way possible!β53Updated last year
- wav2vec2 asr with transformersβ16Updated 3 years ago
- Jupyter Notebooks for creating Speech datasetsβ46Updated 6 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.β80Updated 2 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationβ149Updated last year
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- β19Updated 7 months ago
- Speaker Diarization with Transformersβ69Updated 4 months ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.β16Updated 4 years ago
- Code for AccentDB.β23Updated 4 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 4 years ago
- πΈSTT integration examplesβ129Updated 3 years ago
- STT Service based on Kaldi ASRβ15Updated 7 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ107Updated 3 weeks ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Updated 2 years ago
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ258Updated 2 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videosβ154Updated 5 years ago
- Conversational AI Benchmark.β68Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated 2 years ago