mozilla / DeepSpeech-examplesLinks
Examples of how to use or integrate DeepSpeech
β854Updated 2 years ago
Alternatives and similar repositories for DeepSpeech-examples
Users that are interested in DeepSpeech-examples are comparing it to the libraries listed below
Sorting:
- WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi librariesβ1,134Updated 2 months ago
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,483Updated last year
- On-device streaming speech-to-text engine powered by deep learningβ634Updated last week
- VOSK Speech Recognition Toolkitβ458Updated 3 years ago
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice β¦β513Updated 2 years ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β838Updated last year
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β987Updated last month
- Open tools and data for cloudless automatic speech recognitionβ446Updated 4 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,349Updated last year
- A testing server for a speech to text service based on coqui.aiβ216Updated 3 years ago
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.β584Updated 4 years ago
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.β1,086Updated last year
- An opensource text-to-speech (TTS) voice building toolβ677Updated last year
- πΈSTT integration examplesβ129Updated 2 years ago
- Python interface to the WebRTC Voice Activity Detectorβ2,305Updated last year
- A lightweight, simple-to-use, RNN wake word listenerβ916Updated last year
- Docker image for Mozilla TTS serverβ202Updated last year
- Open Text to Speech Serverβ1,083Updated last year
- A live speech recognition using Facebooks wav2vec 2.0 model.β362Updated last year
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-timeβ342Updated 2 years ago
- π€π¬ Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.β1,150Updated last year
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.β210Updated last year
- A speech recognition library running in the browser thanks to a WebAssembly build of Voskβ466Updated last year
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )β537Updated 3 years ago
- gentle forced alignerβ1,609Updated 2 months ago
- A Python wrapper for Kaldiβ1,021Updated 6 months ago
- An audio/acoustic activity detection and audio segmentation toolβ792Updated 7 months ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.β1,777Updated last week
- A python package to analyze and compare voices with deep learningβ3,045Updated last year
- Simple text to phones converter for multiple languagesβ1,417Updated 10 months ago