mozilla / DeepSpeechLinks
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
☆26,556Updated last month
Alternatives and similar repositories for DeepSpeech
Users that are interested in DeepSpeech are comparing it to the libraries listed below
Sorting:
- Facebook AI Research's Automatic Speech Recognition Toolkit☆6,436Updated 8 months ago
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)☆9,949Updated last year
- kaldi-asr/kaldi is the official location of the Kaldi project.☆15,029Updated 3 weeks ago
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)☆2,981Updated 2 years ago
- Deep neural networks for voice conversion (voice style transfer) in Tensorflow☆3,938Updated 2 years ago
- 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.☆2,493Updated last year
- Speech Recognition using DeepSpeech2.☆2,128Updated 2 years ago
- On-device wake word detection powered by deep learning☆4,308Updated last week
- Examples of how to use or integrate DeepSpeech☆855Updated 2 years ago
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node☆12,918Updated 3 weeks ago
- Common Voice is part of Mozilla's initiative to help teach machines how real people speak.☆3,397Updated this week
- A TensorFlow implementation of DeepMind's WaveNet paper☆5,434Updated 2 years ago
- End-to-End Speech Processing Toolkit☆9,365Updated last week
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆8,821Updated 2 months ago
- A small speech recognizer☆4,166Updated this week
- A python package to analyze and compare voices with deep learning☆3,055Updated last year
- DeepMind's Tacotron-2 Tensorflow implementation☆2,309Updated 2 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference☆5,260Updated last year
- An open source library for deep learning end-to-end dialog systems and chatbots.☆6,922Updated last week
- WaveRNN Vocoder + TTS☆2,165Updated 3 years ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germa…☆3,959Updated last year
- WaveNet vocoder☆2,359Updated 2 years ago
- Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.☆8,688Updated 3 years ago
- MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java☆2,524Updated 6 months ago
- Open-Source Large Vocabulary Continuous Speech Recognition Engine☆1,899Updated last month
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆54,845Updated 2 months ago
- Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications☆6,109Updated last week
- A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model☆1,832Updated 3 years ago
- Production First and Production Ready End-to-End Speech Recognition Toolkit☆4,743Updated last month
- A PyTorch-based Speech Toolkit☆10,272Updated this week