MainRo / deepspeech-server
A testing server for a speech to text service based on coqui.ai
☆215Updated 2 years ago
Alternatives and similar repositories for deepspeech-server:
Users that are interested in deepspeech-server are comparing it to the libraries listed below
- Open tools and data for cloudless automatic speech recognition☆447Updated 4 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 4 years ago
- Mozilla deepspeech server implemented in django.☆49Updated 3 years ago
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conne…☆217Updated 5 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆537Updated 3 years ago
- Dockerfile for kaldi-gstreamer-server.☆290Updated 3 years ago
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 4 years ago
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆173Updated last year
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.☆1,079Updated 10 months ago
- Scripts for training Mozilla's DeepSpeech using german speech data☆41Updated 5 years ago
- FastCGI support for Kaldi ASR☆184Updated 6 years ago
- wake word engine benchmark framework☆134Updated 3 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆471Updated 5 years ago
- Offline transcription system for Estonian using Kaldi☆227Updated 2 years ago
- A webpage and API for using Mozilla DeepSpeech☆47Updated 4 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆205Updated 9 months ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆207Updated 3 years ago
- G2P with Tensorflow☆672Updated 8 months ago
- Phonetisaurus G2P☆471Updated 10 months ago
- Voice Activity Detector in Python☆475Updated 4 years ago
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.☆583Updated 3 years ago
- A neural network intent parser☆162Updated 3 years ago
- A high-level toolkit for speaker recognition, build on top of ALIZE-Core.☆126Updated 6 years ago
- Speaker diarization scripts, based on AaltoASR☆190Updated 6 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆98Updated 3 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated 11 months ago
- voice services stack from audio hardware through hotword, ASR, NLU, AI routing and TTS bound by messaging protocol over MQTT☆94Updated last year
- 🐸STT integration examples☆127Updated 2 years ago