mozilla / DeepSpeech-examples
Examples of how to use or integrate DeepSpeech
β843Updated last year
Alternatives and similar repositories for DeepSpeech-examples:
Users that are interested in DeepSpeech-examples are comparing it to the libraries listed below
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi librariesβ1,007Updated 6 months ago
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,382Updated last year
- Open tools and data for cloudless automatic speech recognitionβ447Updated 3 years ago
- VOSK Speech Recognition Toolkitβ406Updated 2 years ago
- A lightweight, simple-to-use, RNN wake word listenerβ887Updated last year
- Dockerfile for kaldi-gstreamer-server.β289Updated 2 years ago
- A testing server for a speech to text service based on coqui.aiβ215Updated 2 years ago
- Python interface to the WebRTC Voice Activity Detectorβ2,175Updated 8 months ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β835Updated last year
- On-device streaming speech-to-text engine powered by deep learningβ619Updated this week
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,316Updated 9 months ago
- An opensource text-to-speech (TTS) voice building toolβ672Updated 8 months ago
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.β1,076Updated 9 months ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )β536Updated 3 years ago
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice β¦β506Updated last year
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-timeβ340Updated last year
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.β582Updated 3 years ago
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β962Updated this week
- g2p: English Grapheme To Phoneme Conversionβ841Updated 2 years ago
- πΈSTT integration examplesβ126Updated 2 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.β344Updated last year
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.β203Updated 7 months ago
- A Python wrapper for Kaldiβ1,010Updated last month
- A python package to analyze and compare voices with deep learningβ2,879Updated last year
- Offline speech recognition for Android with Vosk library.β812Updated last year
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognitionβ480Updated 3 years ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ953Updated 4 months ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorchβ1,598Updated 11 months ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networksβ438Updated 4 years ago
- On-device speech-to-text engine powered by deep learningβ448Updated this week