castorini / honklingLinks
Web app for keyword spotting using TensorflowJS
β73Updated 2 years ago
Alternatives and similar repositories for honkling
Users that are interested in honkling are comparing it to the libraries listed below
Sorting:
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- Tools to create your own voice dataset for TTS trainingβ68Updated 4 years ago
- Live Audio MFCC Visualization in the browser using Web Audio API - https://pulakk.github.io/Live-Audio-MFCC/tutorialβ41Updated 5 years ago
- DeepSpeech based forced alignment toolβ239Updated 4 years ago
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototypingβ14Updated 7 years ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.β213Updated last year
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β101Updated 2 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ102Updated 5 years ago
- Speaker diarization python system based on binary key speaker modellingβ60Updated 3 years ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voiceβ10Updated 5 years ago
- Pytorch implementation of Deepmind's WaveRNN modelβ121Updated 6 years ago
- πΈSTT integration examplesβ129Updated 2 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 4 years ago
- Python library for handling audio datasets.β138Updated 2 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognitionβ¦β98Updated 3 years ago
- Labeled data for homograph disambiguationβ59Updated 2 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networksβ65Updated 5 years ago
- This repository is a collection of TTS Models in TFLiteβ199Updated 4 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated 2 years ago
- Implements python programs to train and test a Recurrent Neural Network with Tensorflowβ72Updated 5 years ago
- An even smaller speech recognizer / force alignerβ35Updated 8 months ago
- Python server for communicating with Kaldi from the browser using WebRTCβ69Updated last year
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separationβ172Updated 2 years ago
- Multilingual Grapheme to Phonemeβ50Updated 9 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ118Updated 2 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB databaseβ35Updated last year
- On-device voice activity detection (VAD) powered by deep learningβ227Updated 2 weeks ago
- Using Spectral Noise Gating (SNG) techniques to reduce background noise in streaming microphone input for enhanced vocal recognitionβ25Updated 6 years ago