gooofy / zamia-speech
Open tools and data for cloudless automatic speech recognition
☆443Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for zamia-speech
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆174Updated last year
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 4 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆532Updated 2 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆464Updated 4 years ago
- Dockerfile for kaldi-gstreamer-server.☆287Updated 2 years ago
- Phonetisaurus G2P☆449Updated 5 months ago
- DeepSpeech based forced alignment tool☆233Updated 3 years ago
- g2p: English Grapheme To Phoneme Conversion☆810Updated last year
- Speaker diarization scripts, based on AaltoASR☆190Updated 5 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆428Updated 4 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆200Updated 3 years ago
- Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).☆269Updated last year
- This is a github repository of the abandonware Sequitur G2P by Bisani & Ney☆155Updated 4 months ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆376Updated last year
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆313Updated 10 months ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech proc…☆364Updated last month
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.☆1,071Updated 5 months ago
- Python interface for forced audio alignment using HTK and SoX☆331Updated 4 years ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.☆577Updated 2 years ago
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection☆372Updated last year
- FastCGI support for Kaldi ASR☆184Updated 5 years ago
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆469Updated 3 years ago
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆338Updated last year
- A testing server for a speech to text service based on coqui.ai☆214Updated 2 years ago
- Voice Activity Detector in Python☆472Updated 3 years ago
- wake word engine benchmark framework☆131Updated 2 years ago
- End-2-end speech synthesis with recurrent neural networks☆225Updated 8 months ago
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conne…☆216Updated 4 years ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆354Updated last year