mozilla / DeepSpeechLinks
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
☆26,588Updated 2 months ago
Alternatives and similar repositories for DeepSpeech
Users that are interested in DeepSpeech are comparing it to the libraries listed below
Sorting:
- Facebook AI Research's Automatic Speech Recognition Toolkit☆6,440Updated 9 months ago
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)☆9,986Updated last year
- kaldi-asr/kaldi is the official location of the Kaldi project.☆15,090Updated last month
- Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow☆3,987Updated 3 years ago
- 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.☆2,507Updated last year
- A small speech recognizer☆4,178Updated last week
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆8,855Updated 3 months ago
- Common Voice is part of Mozilla's initiative to help teach machines how real people speak.☆3,413Updated this week
- End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow☆2,843Updated 2 years ago
- 🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks☆2,173Updated last year
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)☆2,982Updated 2 years ago
- Examples of how to use or integrate DeepSpeech☆858Updated 2 years ago
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…☆15,558Updated this week
- Open-Source Large Vocabulary Continuous Speech Recognition Engine☆1,900Updated 2 months ago
- Video editing with Python☆13,870Updated last week
- Deep neural networks for voice conversion (voice style transfer) in Tensorflow☆3,938Updated 2 years ago
- On-device wake word detection powered by deep learning☆4,344Updated 2 weeks ago
- Speech Recognition using DeepSpeech2.☆2,131Updated 2 years ago
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆32,404Updated 3 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆87,677Updated 2 weeks ago
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆33,316Updated this week
- Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synth…☆3,064Updated last year
- Port of OpenAI's Whisper model in C/C++☆42,907Updated last week
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node☆13,138Updated this week
- A TensorFlow implementation of DeepMind's WaveNet paper☆5,434Updated 2 years ago
- Cross-platform, customizable ML solutions for live and streaming media.☆31,168Updated this week
- Magenta: Music and Art Generation with Machine Intelligence☆19,629Updated last month
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆54,934Updated 3 months ago
- GPT-3: Language Models are Few-Shot Learners☆15,781Updated 4 years ago
- Python interface to the WebRTC Voice Activity Detector☆2,339Updated last year