mozilla / DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
β25,463Updated 3 months ago
Alternatives and similar repositories for DeepSpeech:
Users that are interested in DeepSpeech are comparing it to the libraries listed below
- Facebook AI Research's Automatic Speech Recognition Toolkitβ6,397Updated 2 weeks ago
- kaldi-asr/kaldi is the official location of the Kaldi project.β14,345Updated last week
- πSpeech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networksβ2,165Updated 10 months ago
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β9,457Updated last year
- End-to-End Speech Processing Toolkitβ8,575Updated this week
- Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflowβ3,960Updated 3 years ago
- Tesseract Open Source OCR Engine (main repository)β63,028Updated this week
- Speech recognition module for Python, supporting several engines and APIs, online and offline.β8,469Updated this week
- π¬ Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, β¦β19,037Updated this week
- Face recognition with deep neural networks.β15,161Updated 2 months ago
- End-to-end Automatic Speech Recognition for Madarian and English in Tensorflowβ2,843Updated last year
- Library for fast text representation and classification.β25,970Updated 8 months ago
- Build and run Docker containers leveraging NVIDIA GPUsβ17,271Updated last year
- A small speech recognizerβ3,971Updated 2 weeks ago
- Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkitβ17,532Updated last year
- Examples of how to use or integrate DeepSpeechβ822Updated last year
- Visualizer for neural network, deep learning and machine learning modelsβ28,620Updated this week
- Snips Python library to extract meaning from textβ3,898Updated last year
- A framework for training and evaluating AI models on a variety of openly available dialogue datasets.β10,499Updated last year
- Deep Learning for humansβ62,187Updated this week
- Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthβ¦β2,997Updated last year
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and β¦β24,801Updated 2 months ago
- Port of OpenAI's Whisper model in C/C++β36,103Updated this week
- Clone a voice in 5 seconds to generate arbitrary speech in real-timeβ52,931Updated 3 months ago
- Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torchβ11,665Updated last year
- On-device wake word detection powered by deep learningβ3,810Updated 2 weeks ago
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)β2,959Updated last year
- A toolkit for making real world machine learning and data analysis applications in C++β13,611Updated 3 weeks ago
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Nodeβ8,243Updated 3 weeks ago
- Code for the paper "Language Models are Unsupervised Multitask Learners"β22,625Updated 3 months ago