castorini / honklingLinks
Web app for keyword spotting using TensorflowJS
β71Updated 2 years ago
Alternatives and similar repositories for honkling
Users that are interested in honkling are comparing it to the libraries listed below
Sorting:
- πΈTTS recipes for different datasetsβ87Updated 2 years ago
- Pytorch implementation of Deepmind's WaveRNN modelβ119Updated 5 years ago
- Command line tool to create corpora for Common Voiceβ76Updated last year
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ102Updated 5 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, reβ¦β47Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β102Updated 2 years ago
- Live Audio MFCC Visualization in the browser using Web Audio API - https://pulakk.github.io/Live-Audio-MFCC/tutorialβ41Updated 5 years ago
- DeepSpeech based forced alignment toolβ237Updated 4 years ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voiceβ10Updated 4 years ago
- An even smaller speech recognizer / force alignerβ33Updated 5 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ114Updated 2 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 4 years ago
- VCTK multi-speaker tacotron for ICASSP 2020β266Updated 3 years ago
- On-device voice activity detection (VAD) powered by deep learningβ216Updated 3 weeks ago
- Speaker diarization python system based on binary key speaker modellingβ61Updated 3 years ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- Jupyter Notebooks for creating Speech datasetsβ46Updated 6 years ago
- This repository is a collection of TTS Models in TFLiteβ193Updated 4 years ago
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conneβ¦β216Updated 5 years ago
- Tensorflow Implementation of Expressive Tacotronβ196Updated 6 years ago
- Speaker diarization scripts, based on AaltoASRβ190Updated 6 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 3 years ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.β29Updated 11 months ago
- speaker diarization system using an LSTMβ50Updated 2 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognitionβ¦β98Updated 3 years ago
- π Coqui's machine learning job schedulerβ32Updated 3 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Yβ¦β25Updated 6 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problemβ51Updated 6 years ago
- Multilingual Grapheme to Phonemeβ49Updated 9 years ago
- Speech-to-text based on wav2letter built for transfer learningβ97Updated 2 years ago