alumae / kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
☆1,081Updated 11 months ago
Alternatives and similar repositories for kaldi-gstreamer-server:
Users that are interested in kaldi-gstreamer-server are comparing it to the libraries listed below
- Dockerfile for kaldi-gstreamer-server.☆290Updated 3 years ago
- GStreamer plugin around Kaldi's online neural network decoder☆186Updated 4 years ago
- Open tools and data for cloudless automatic speech recognition☆447Updated 4 years ago
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conne…☆217Updated 5 years ago
- G2P with Tensorflow☆673Updated 9 months ago
- ☆526Updated 2 years ago
- The official repository of the Eesen project☆829Updated 5 years ago
- A Python wrapper for Kaldi☆1,015Updated 3 months ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆537Updated 3 years ago
- Phonetisaurus G2P☆473Updated 11 months ago
- Offline transcription system for Estonian using Kaldi☆227Updated 2 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
- This is now the official location of the Merlin project.☆1,313Updated 5 years ago
- Python module installed with setup.py☆337Updated 2 years ago
- Speaker diarization scripts, based on AaltoASR☆190Updated 6 years ago
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.☆1,863Updated 2 years ago
- A testing server for a speech to text service based on coqui.ai☆215Updated 2 years ago
- FastCGI support for Kaldi ASR☆184Updated 6 years ago
- Python interface to the WebRTC Voice Activity Detector☆2,225Updated 10 months ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆943Updated 8 months ago
- Acoustic model trainer for CMU Sphinx☆185Updated 4 months ago
- A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model☆1,832Updated 3 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,324Updated 11 months ago
- A collection of links and notes on forced alignment tools☆906Updated 3 years ago
- g2p: English Grapheme To Phoneme Conversion☆849Updated 2 years ago
- A Speaker Recognition System☆675Updated 5 years ago
- Voice Activity Detector in Python☆475Updated 4 years ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.☆854Updated 3 years ago
- 🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks☆2,170Updated last year
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆173Updated last year