gooofy / zamia-speech
Open tools and data for cloudless automatic speech recognition
☆447Updated 3 years ago
Alternatives and similar repositories for zamia-speech:
Users that are interested in zamia-speech are comparing it to the libraries listed below
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆535Updated 3 years ago
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 4 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- Speaker diarization scripts, based on AaltoASR☆190Updated 6 years ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆173Updated last year
- Phonetisaurus G2P☆462Updated 8 months ago
- Dockerfile for kaldi-gstreamer-server.☆289Updated 2 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆434Updated 4 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆376Updated last year
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆467Updated 4 years ago
- Python interface for forced audio alignment using HTK and SoX☆334Updated 4 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆203Updated 3 years ago
- Voice Activity Detector in Python☆472Updated 4 years ago
- g2p: English Grapheme To Phoneme Conversion☆836Updated 2 years ago
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conne…☆216Updated 4 years ago
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.☆1,076Updated 8 months ago
- A Python wrapper for Kaldi☆1,006Updated 3 weeks ago
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆320Updated last year
- Speech recognition software where the neural net is trained with TensorFlow and GMM training and decoding is done in Kaldi☆170Updated 8 years ago
- Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.☆308Updated 3 years ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.☆851Updated 3 years ago
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆339Updated last year
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆478Updated 3 years ago
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection☆375Updated last year
- Tools for Speech Enhancement integrated with Kaldi☆409Updated last year
- Keras Interface for Kaldi ASR☆121Updated 7 years ago
- FastCGI support for Kaldi ASR☆185Updated 5 years ago
- End-2-end speech synthesis with recurrent neural networks☆226Updated 11 months ago
- This is a github repository of the abandonware Sequitur G2P by Bisani & Ney☆157Updated 7 months ago