alumae / kaldi-gstreamer-serverLinks
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
☆1,082Updated 11 months ago
Alternatives and similar repositories for kaldi-gstreamer-server
Users that are interested in kaldi-gstreamer-server are comparing it to the libraries listed below
Sorting:
- Dockerfile for kaldi-gstreamer-server.☆288Updated 3 years ago
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 4 years ago
- Open tools and data for cloudless automatic speech recognition☆446Updated 4 years ago
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conne…☆216Updated 5 years ago
- A Speaker Recognition System☆676Updated 5 years ago
- G2P with Tensorflow☆674Updated 10 months ago
- A Python wrapper for Kaldi☆1,017Updated 4 months ago
- This is now the official location of the Merlin project.☆1,314Updated 5 years ago
- The official repository of the Eesen project☆829Updated 6 years ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine☆539Updated last year
- Python interface to the WebRTC Voice Activity Detector☆2,246Updated 10 months ago
- Offline transcription system for Estonian using Kaldi☆227Updated 2 years ago
- FastCGI support for Kaldi ASR☆183Updated 6 years ago
- 🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks☆2,171Updated last year
- Phonetisaurus G2P☆477Updated last year
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆535Updated 3 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆444Updated 4 years ago
- Voice Activity Detector in Python☆475Updated 4 years ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.☆858Updated 3 years ago
- An audio/acoustic activity detection and audio segmentation tool☆778Updated 5 months ago
- A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model☆1,832Updated 3 years ago
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆342Updated last year
- Deep Learning & 3D Convolutional Neural Networks for Speaker Verification☆785Updated 5 years ago
- ☆483Updated 7 years ago
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.☆1,864Updated 2 years ago
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries☆1,048Updated last week
- FSA/FST algorithms, differentiable, with PyTorch compatibility.☆1,201Updated last week
- Deep Speaker: an End-to-End Neural Speaker Embedding System.☆930Updated last year
- Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow☆3,985Updated 3 years ago