castorini / honklingLinks
Web app for keyword spotting using TensorflowJS
β74Updated 3 years ago
Alternatives and similar repositories for honkling
Users that are interested in honkling are comparing it to the libraries listed below
Sorting:
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- Speaker diarization scripts, based on AaltoASRβ191Updated 6 years ago
- speaker diarization system using an LSTMβ50Updated 2 years ago
- DeepSpeech based forced alignment toolβ239Updated 4 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB databaseβ35Updated last year
- Forced Alignments for Common Voiceβ31Updated 5 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β107Updated 2 years ago
- This repository is a collection of TTS Models in TFLiteβ201Updated 4 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ103Updated 5 years ago
- Pytorch implementation of Deepmind's WaveRNN modelβ122Updated 6 years ago
- Speaker diarization python system based on binary key speaker modellingβ60Updated 3 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.β214Updated last year
- Command line tool to create corpora for Common Voiceβ78Updated last week
- On-device voice activity detection (VAD) powered by deep learningβ235Updated this week
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognitionβ¦β99Updated 3 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networksβ65Updated 5 years ago
- π Coqui's machine learning job schedulerβ31Updated 4 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated 2 years ago
- Live Audio MFCC Visualization in the browser using Web Audio API - https://pulakk.github.io/Live-Audio-MFCC/tutorialβ41Updated 5 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separationβ173Updated 3 years ago
- Speech-to-text based on wav2letter built for transfer learningβ98Updated 3 years ago
- πΈSTT integration examplesβ129Updated 3 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.β15Updated 5 years ago
- Python library for handling audio datasets.β138Updated 2 years ago
- A tool for automatic phoneme transcriptionβ159Updated 2 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 4 years ago
- Python server for communicating with Kaldi from the browser using WebRTCβ69Updated 2 years ago
- Jupyter Notebooks for creating Speech datasetsβ46Updated 6 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ119Updated 2 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activitiesβ232Updated 4 years ago