MainRo / deepspeech-server
A testing server for a speech to text service based on coqui.ai
☆215Updated 2 years ago
Alternatives and similar repositories for deepspeech-server:
Users that are interested in deepspeech-server are comparing it to the libraries listed below
- Open tools and data for cloudless automatic speech recognition☆447Updated 4 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
- Phonetisaurus G2P☆473Updated 11 months ago
- Dockerfile for kaldi-gstreamer-server.☆290Updated 3 years ago
- GStreamer plugin around Kaldi's online neural network decoder☆186Updated 4 years ago
- Mozilla deepspeech server implemented in django.☆49Updated 3 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 4 years ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Examples of how to use or integrate DeepSpeech☆845Updated last year
- FastCGI support for Kaldi ASR☆184Updated 6 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆470Updated 5 years ago
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆173Updated last year
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conne…☆217Updated 5 years ago
- Speaker diarization scripts, based on AaltoASR☆190Updated 6 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆537Updated 3 years ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆363Updated 2 years ago
- Voice Activity Detector in Python☆475Updated 4 years ago
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.☆1,081Updated 10 months ago
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection☆379Updated 2 years ago
- g2p: English Grapheme To Phoneme Conversion☆849Updated 2 years ago
- G2P with Tensorflow☆673Updated 9 months ago
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.☆583Updated 3 years ago
- 🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)☆224Updated 4 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆376Updated last year
- A list of publically available audio data that anyone can download for ASR or other speech activities☆209Updated 3 years ago
- A webpage and API for using Mozilla DeepSpeech☆47Updated 4 years ago
- wake word engine benchmark framework☆134Updated 3 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆439Updated 4 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated last year
- 🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).☆380Updated 2 years ago