mozilla / DeepSpeech-examples
Examples of how to use or integrate DeepSpeech
β821Updated last year
Related projects β
Alternatives and complementary repositories for DeepSpeech-examples
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,283Updated 5 months ago
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β941Updated 2 months ago
- VOSK Speech Recognition Toolkitβ383Updated 2 years ago
- Open tools and data for cloudless automatic speech recognitionβ443Updated 3 years ago
- A testing server for a speech to text service based on coqui.aiβ215Updated 2 years ago
- Python interface to the WebRTC Voice Activity Detectorβ2,068Updated 4 months ago
- On-device streaming speech-to-text engine powered by deep learningβ594Updated 2 weeks ago
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi librariesβ929Updated 2 months ago
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.β580Updated 3 years ago
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,284Updated 8 months ago
- A lightweight, simple-to-use, RNN wake word listenerβ852Updated 11 months ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β830Updated last year
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )β532Updated 2 years ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech procβ¦β364Updated last month
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice β¦β500Updated last year
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.β1,073Updated 5 months ago
- Efficient neural speech synthesisβ1,142Updated 2 months ago
- g2p: English Grapheme To Phoneme Conversionβ811Updated last year
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-timeβ339Updated last year
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognitionβ469Updated 3 years ago
- The Implementation of FastSpeech based on pytorch.β858Updated last year
- A Python wrapper for Kaldiβ999Updated 3 months ago
- Dockerfile for kaldi-gstreamer-server.β288Updated 2 years ago
- Voice Activity Detection based on Deep Learning & TensorFlowβ355Updated last year
- Large, modern dataset for speech recognitionβ646Updated 8 months ago
- Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.β585Updated last year
- πΈSTT integration examplesβ121Updated 2 years ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Searchβ667Updated 2 years ago
- an open-source implementation of sequence-to-sequence based speech processing engineβ952Updated last year
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.β170Updated 3 years ago