daanzu / deepspeech-websocket-server
Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments
☆101Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for deepspeech-websocket-server
- A testing server for a speech to text service based on coqui.ai☆215Updated 2 years ago
- Speaker diarization scripts, based on AaltoASR☆190Updated 5 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated 6 months ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 2 years ago
- DeepSpeech based forced alignment tool☆234Updated 3 years ago
- Speech-to-text based on wav2letter built for transfer learning☆96Updated 2 years ago
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 4 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆200Updated 3 months ago
- Web app for keyword spotting using TensorflowJS☆69Updated last year
- Python library for handling audio datasets.☆131Updated last year
- Python server for communicating with Kaldi from the browser using WebRTC☆67Updated last year
- A list of publically available audio data that anyone can download for ASR or other speech activities☆200Updated 3 years ago
- Server framework for Kaldi ASR Toolkit☆97Updated last year
- Program to benchmark various speech recognition APIs☆79Updated 5 years ago
- Mozilla deepspeech server implemented in django.☆49Updated 3 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 5 years ago
- On-device voice activity detection (VAD) powered by deep learning☆179Updated this week
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 4 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- 🐸TTS recipes for different datasets☆84Updated 2 years ago
- Open tools and data for cloudless automatic speech recognition☆443Updated 3 years ago
- Command line tool to create corpora for Common Voice☆75Updated 5 months ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆37Updated 2 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 2 years ago
- How to create your own model for vosk☆64Updated 3 years ago
- End-2-end speech synthesis with recurrent neural networks☆225Updated 8 months ago
- Wheels for tensorflow and DeepSpeech compiled for NVidia Jetson Nano (arm64)☆89Updated 3 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆187Updated last year