JPEWdev / deep-dregs
A streaming Speech to Text server using DeepSpeech
☆16Updated 4 years ago
Alternatives and similar repositories for deep-dregs:
Users that are interested in deep-dregs are comparing it to the libraries listed below
- SEPIA server to support open-source speech recognition via WebSocket connection.☆123Updated 3 months ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Updated 6 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 5 years ago
- Awesome stuff made by the Mycroft community☆14Updated 3 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆26Updated 2 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- Mycroft's multilingual text parsing and formatting library☆76Updated last year
- ☆40Updated 6 years ago
- Silence detection in audio stream using webrtcvad☆46Updated last year
- STT Service based on Kaldi ASR☆15Updated 6 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆35Updated 3 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?☆34Updated 6 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 5 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- python wrapper for rnnoise library☆46Updated 2 years ago
- ☆10Updated last week
- A collection of basic python modules for spoken natural language processing☆56Updated 5 years ago
- Text to speech plugin for Mycroft using Mimic 3☆7Updated 2 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- A neural network intent parser☆161Updated 3 years ago
- speaker diarization system using an LSTM☆49Updated 2 years ago
- A simple audio feature extraction library☆79Updated 5 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable wo…☆69Updated 7 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- Automated lyrics-to-audio alignment using syllabic nuclei detection. Developed during Google Summer of Code 2019.☆50Updated last year
- EARS: Environmental Audio Recognition System☆111Updated 6 years ago
- Server framework for Kaldi ASR Toolkit☆98Updated last year
- A Collection of Speech Corpus for ASR and TTS☆113Updated 7 years ago
- Api.ai English Speech Recognition (ASR) Model for Kaldi☆36Updated 4 years ago