castorini / honkling
Web app for keyword spotting using TensorflowJS
☆71Updated 2 years ago
Alternatives and similar repositories for honkling:
Users that are interested in honkling are comparing it to the libraries listed below
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated last year
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 4 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆205Updated 9 months ago
- 🐸TTS recipes for different datasets☆87Updated 2 years ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 4 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆98Updated 3 years ago
- On-device voice activity detection (VAD) powered by deep learning☆208Updated this week
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆209Updated 3 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 5 years ago
- 🐸STT integration examples☆127Updated 2 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated last year
- Buildings block for voice-enabled applications in the browser☆37Updated 2 weeks ago
- An even smaller speech recognizer / force aligner☆32Updated 4 months ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- speaker diarization system using an LSTM☆50Updated 2 years ago
- Live Audio MFCC Visualization in the browser using Web Audio API - https://pulakk.github.io/Live-Audio-MFCC/tutorial☆41Updated 5 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆170Updated 5 months ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆106Updated 2 years ago
- Speaker diarization scripts, based on AaltoASR☆190Updated 6 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Extract frequency, power, width and dissonance of formants from wav files☆25Updated 2 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- Python library for handling audio datasets.☆137Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆112Updated 2 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 5 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆248Updated 9 months ago