gooofy / zamia-speech
Open tools and data for cloudless automatic speech recognition
☆444Updated 3 years ago
Alternatives and similar repositories for zamia-speech:
Users that are interested in zamia-speech are comparing it to the libraries listed below
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆173Updated last year
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆533Updated 2 years ago
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 4 years ago
- Phonetisaurus G2P☆453Updated 6 months ago
- Dockerfile for kaldi-gstreamer-server.☆288Updated 2 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆466Updated 4 years ago
- Speaker diarization scripts, based on AaltoASR☆191Updated 5 years ago
- DeepSpeech based forced alignment tool☆235Updated 3 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆376Updated last year
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆430Updated 4 years ago
- Offline transcription system for Estonian using Kaldi☆226Updated 2 years ago
- Python interface for forced audio alignment using HTK and SoX☆332Updated 4 years ago
- g2p: English Grapheme To Phoneme Conversion☆814Updated last year
- FastCGI support for Kaldi ASR☆184Updated 5 years ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.☆843Updated 3 years ago
- Voice Activity Detector in Python☆472Updated 4 years ago
- Large, modern dataset for speech recognition☆649Updated 9 months ago
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection☆374Updated last year
- A testing server for a speech to text service based on coqui.ai☆215Updated 2 years ago
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆314Updated 11 months ago
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆471Updated 3 years ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech proc…☆364Updated 2 months ago
- Utterance-level Aggregation For Speaker Recognition In The Wild☆365Updated last year
- Tools for Speech Enhancement integrated with Kaldi☆400Updated last year
- Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).☆273Updated last year
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆942Updated 2 months ago
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conne…☆216Updated 4 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆290Updated 3 years ago
- This is a github repository of the abandonware Sequitur G2P by Bisani & Ney☆156Updated 4 months ago