daanzu / deepspeech-websocket-server
Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments
☆102Updated 4 years ago
Alternatives and similar repositories for deepspeech-websocket-server:
Users that are interested in deepspeech-websocket-server are comparing it to the libraries listed below
- A testing server for a speech to text service based on coqui.ai☆215Updated 2 years ago
- DeepSpeech based forced alignment tool☆235Updated 4 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- 🐸STT integration examples☆123Updated 2 years ago
- Program to benchmark various speech recognition APIs☆80Updated 5 years ago
- Speech-to-text based on wav2letter built for transfer learning☆97Updated 2 years ago
- 🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)☆223Updated 4 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆203Updated 6 months ago
- On-device voice activity detection (VAD) powered by deep learning☆192Updated 2 weeks ago
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 4 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated 8 months ago
- Mozilla deepspeech server implemented in django.☆49Updated 3 years ago
- Open tools and data for cloudless automatic speech recognition☆447Updated 3 years ago
- Web app for keyword spotting using TensorflowJS☆69Updated 2 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- Command line tool to create corpora for Common Voice☆75Updated 8 months ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 5 years ago
- Server framework for Kaldi ASR Toolkit☆98Updated last year
- End-2-end speech synthesis with recurrent neural networks☆225Updated 11 months ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆202Updated 3 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆123Updated 2 months ago
- Python library for handling audio datasets.☆136Updated last year
- voice services stack from audio hardware through hotword, ASR, NLU, AI routing and TTS bound by messaging protocol over MQTT☆93Updated last year
- Speaker diarization scripts, based on AaltoASR☆190Updated 6 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆94Updated 2 weeks ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆63Updated 4 years ago
- Evaluate results from ASR/Speech-to-Text quickly☆36Updated 3 years ago
- A Collection of Speech Corpus for ASR and TTS☆113Updated 7 years ago