alumae / kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
☆1,076Updated 8 months ago
Alternatives and similar repositories for kaldi-gstreamer-server:
Users that are interested in kaldi-gstreamer-server are comparing it to the libraries listed below
- Dockerfile for kaldi-gstreamer-server.☆289Updated 2 years ago
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 4 years ago
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conne…☆216Updated 4 years ago
- A Python wrapper for Kaldi☆1,006Updated 3 weeks ago
- Open tools and data for cloudless automatic speech recognition☆447Updated 3 years ago
- Offline transcription system for Estonian using Kaldi☆228Updated 2 years ago
- The official repository of the Eesen project☆826Updated 5 years ago
- FastCGI support for Kaldi ASR☆185Updated 5 years ago
- G2P with Tensorflow☆671Updated 6 months ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆535Updated 3 years ago
- A Speaker Recognition System☆676Updated 4 years ago
- Phonetisaurus G2P☆462Updated 8 months ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆339Updated last year
- Python interface for forced audio alignment using HTK and SoX☆334Updated 4 years ago
- A collection of links and notes on forced alignment tools☆890Updated 3 years ago
- Python interface to the WebRTC Voice Activity Detector☆2,147Updated 7 months ago
- This is now the official location of the Merlin project.☆1,311Updated 4 years ago
- A high-level toolkit for speaker recognition, build on top of ALIZE-Core.☆126Updated 6 years ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆943Updated 5 months ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆434Updated 4 years ago
- A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model☆1,828Updated 3 years ago
- Command line utility for forced alignment using Kaldi☆1,402Updated 2 months ago
- Examples of how to use or integrate DeepSpeech☆836Updated last year
- A testing server for a speech to text service based on coqui.ai☆215Updated 2 years ago
- An audio/acoustic activity detection and audio segmentation tool☆765Updated 2 months ago
- Voice Activity Detector in Python☆472Updated 4 years ago
- g2p: English Grapheme To Phoneme Conversion☆836Updated 2 years ago
- On-device streaming speech-to-text engine powered by deep learning☆609Updated last week
- Connectionist Temporal Classification (CTC) Automatic Speech Recognition☆297Updated 6 years ago