mozilla / DeepSpeech-examplesLinks
Examples of how to use or integrate DeepSpeech
β857Updated 2 years ago
Alternatives and similar repositories for DeepSpeech-examples
Users that are interested in DeepSpeech-examples are comparing it to the libraries listed below
Sorting:
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,514Updated last year
- On-device streaming speech-to-text engine powered by deep learningβ637Updated last week
- A lightweight, simple-to-use, RNN wake word listenerβ926Updated last year
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi librariesβ1,160Updated last month
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.β584Updated 4 years ago
- VOSK Speech Recognition Toolkitβ472Updated 3 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,354Updated last year
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice β¦β513Updated 2 years ago
- Open tools and data for cloudless automatic speech recognitionβ447Updated 4 years ago
- An opensource text-to-speech (TTS) voice building toolβ680Updated last year
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β994Updated 2 months ago
- Python interface to the WebRTC Voice Activity Detectorβ2,344Updated last year
- On-device Speech-to-Intent engine powered by deep learningβ679Updated this week
- A testing server for a speech to text service based on coqui.aiβ216Updated 3 years ago
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.β1,088Updated last year
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-timeβ344Updated last week
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β838Updated last year
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.β213Updated last year
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )β537Updated 3 years ago
- πΈSTT integration examplesβ129Updated 2 years ago
- On-device speech-to-text engine powered by deep learningβ458Updated this week
- Open-Source Large Vocabulary Continuous Speech Recognition Engineβ1,901Updated 2 months ago
- A Python wrapper for Kaldiβ1,027Updated 7 months ago
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender β¦β834Updated 8 months ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networksβ451Updated 5 years ago
- Open Text to Speech Serverβ1,099Updated last year
- Dockerfile for kaldi-gstreamer-server.β290Updated 3 years ago
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)β2,983Updated 2 years ago
- g2p: English Grapheme To Phoneme Conversionβ877Updated 2 years ago
- OneShot Learning-based hotword detection.β282Updated 11 months ago