mozilla / DeepSpeechLinks
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
☆26,594Updated 2 months ago
Alternatives and similar repositories for DeepSpeech
Users that are interested in DeepSpeech are comparing it to the libraries listed below
Sorting:
- Facebook AI Research's Automatic Speech Recognition Toolkit☆6,441Updated 9 months ago
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)☆10,003Updated last year
- Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow☆3,988Updated 3 years ago
- kaldi-asr/kaldi is the official location of the Kaldi project.☆15,100Updated last month
- A small speech recognizer☆4,188Updated this week
- Common Voice is part of Mozilla's initiative to help teach machines how real people speak.☆3,415Updated this week
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆8,858Updated 3 months ago
- 🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks☆2,173Updated last year
- On-device wake word detection powered by deep learning☆4,368Updated this week
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node☆13,172Updated last week
- 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.☆2,514Updated last year
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)☆2,983Updated 2 years ago
- Python library and CLI tool to interface with Google Translate's text-to-speech API☆2,520Updated last week
- Examples of how to use or integrate DeepSpeech☆857Updated 2 years ago
- End-to-End Speech Processing Toolkit☆9,449Updated this week
- Future versions with model training module will be maintained through a forked version here: https://github.com/seasalt-ai/snowboy☆3,287Updated 3 years ago
- Deep neural networks for voice conversion (voice style transfer) in Tensorflow☆3,939Updated 2 years ago
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆20,826Updated last year
- End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow☆2,843Updated 2 years ago
- Mycroft Core, the Mycroft Artificial Intelligence platform.☆6,605Updated last year
- MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java☆2,532Updated 7 months ago
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.☆5,550Updated this week
- An open source library for deep learning end-to-end dialog systems and chatbots.☆6,928Updated last month
- Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text fronten…☆12,225Updated this week
- Port of OpenAI's Whisper model in C/C++☆43,071Updated last week
- A TensorFlow implementation of DeepMind's WaveNet paper☆5,435Updated 2 years ago
- A multi-voice TTS system trained with an emphasis on quality☆14,572Updated 9 months ago
- PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models☆1,980Updated last year
- A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model☆1,832Updated 3 years ago
- Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synth…☆3,068Updated last year