mozilla/deepspeech-playbook

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mozilla/deepspeech-playbook)

mozilla / deepspeech-playbook

DEPRECATED - A crash course for training speech recognition models using DeepSpeech.

☆24

Alternatives and similar repositories for deepspeech-playbook

Users that are interested in deepspeech-playbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CypherousSkies / reading-for-listeners
View on GitHub
A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!
☆25Feb 17, 2025Updated last year
coqui-ai / data-checker
View on GitHub
🫠 check your data, before you wreck your model
☆16Aug 11, 2022Updated 3 years ago
Digital-Umuganda / Deepspeech-Kinyarwanda
View on GitHub
The kinyarwanda model for deepspeech
☆17May 11, 2021Updated 5 years ago
domcross / german-stt-evaluation
View on GitHub
Evaluation of STT models for german language
☆16Jan 22, 2022Updated 4 years ago
mpourmpoulis / PythonVoiceCodingPlugin
View on GitHub
Sublime Text 3 plugin for voice coding Python 3
☆13Sep 15, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
iisys-hof / HUI-Audio-Corpus-German
View on GitHub
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…
☆35Mar 31, 2023Updated 3 years ago
archiki / ASR-Accent-Analysis
View on GitHub
Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.
☆15Jun 27, 2020Updated 6 years ago
rhasspy / glow-speak
View on GitHub
Neural text to speech system that uses eSpeak as a text/phoneme front-end
☆16Oct 20, 2021Updated 4 years ago
coqui-ai / TTS-recipes
View on GitHub
🐸TTS recipes for different datasets
☆88Jul 26, 2022Updated 3 years ago
ChanceNCounter / awesome-mycroft-community
View on GitHub
Awesome stuff made by the Mycroft community
☆12Sep 16, 2021Updated 4 years ago
lucky-bai / kaggle-speech-recognition
View on GitHub
TensorFlow Speech Recognition Challenge (Top 15%)
☆14Jan 16, 2018Updated 8 years ago
coqui-ai / snakepit
View on GitHub
🐍 Coqui's machine learning job scheduler
☆31Sep 5, 2021Updated 4 years ago
elpimous / yellow_robot
View on GitHub
a quadruped robot, inspired from OpenQuadripeder/microspotAi robot
☆17Oct 21, 2020Updated 5 years ago
shleeable / bonusmastodondocs
View on GitHub
☆12Dec 11, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
olastor / german-word-frequencies
View on GitHub
Simple word to frequency mappings for the german language based on text corpora and using CISTEM stemmer.
☆14Apr 3, 2021Updated 5 years ago
nils-werner / pymushra
View on GitHub
pyMUSHRA is a python web application which hosts webMUSHRA experiments and collects the data with python.
☆47Jul 3, 2026Updated 3 weeks ago
chitralekha18 / AutomaticSungLyricsAnnotation_ISMIR2018
View on GitHub
☆22Sep 26, 2022Updated 3 years ago
smaybius / Coqui-TTS-GUI-solution
View on GitHub
Interface for using TTS and vocoder models in the form of a text editor
☆20Nov 25, 2025Updated 8 months ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
AI-Lab-Makerere / Data4Good
View on GitHub
This repository contains publicly available speech and text data in Luganda.
☆12Sep 4, 2020Updated 5 years ago
vadimkantorov / inferspeech
View on GitHub
PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant
☆10Aug 12, 2019Updated 6 years ago
mozilla / DeepSpeech-examples
View on GitHub
Examples of how to use or integrate DeepSpeech
☆854Jul 25, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
maelstromdat / YOSHI
View on GitHub
a tool to study online software development communities
☆12Apr 13, 2026Updated 3 months ago
emirdemirel / ALTA
View on GitHub
A complete training recipe for kaldi-based Automatic Lyrics Transcription.
☆32Nov 30, 2021Updated 4 years ago
daanzu / deepspeech-websocket-server
View on GitHub
Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments
☆103May 29, 2020Updated 6 years ago
common-voice / CorporaCreator
View on GitHub
Command line tool to create corpora for Common Voice
☆78Mar 25, 2026Updated 4 months ago
vadimkantorov / readaudio
View on GitHub
Read audio with FFmpeg into NumPy/PyTorch via ctypes (standard library module)
☆11Aug 12, 2020Updated 5 years ago
OpenVoiceOS / ovos-personal-backend
View on GitHub
personal backend - self-hosted backend to manage multiple OVOS devices
☆84Sep 16, 2024Updated last year
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
usc-sail / barista
View on GitHub
Barista is an open-source framework for concurrent speech processing.
☆36Mar 19, 2014Updated 12 years ago
eric-haibin-lin / JSALT19-GluonNLP
View on GitHub
JSALT 2019 Montréal: Dive into Deep Learning for Natural Language Processing
☆16Jun 14, 2019Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
JRMeyer / common-voice-forced-alignments
View on GitHub
Forced Alignments for Common Voice
☆33Oct 30, 2020Updated 5 years ago
nickbild / go_motion
View on GitHub
Simplify stop motion animation with machine learning.
☆28Sep 17, 2021Updated 4 years ago
thorstenMueller / cTTS
View on GitHub
TTS Client for Coqui TTS server
☆13Jan 7, 2023Updated 3 years ago
arirawr / workshop-templates
View on GitHub
Workshop Template Pack for workshop facilitators
☆15Nov 29, 2017Updated 8 years ago
AusDTO / service-handbook
View on GitHub
The DTO's guide to building digital services
☆14Apr 19, 2018Updated 8 years ago
JRMeyer / markdown-datasheet-for-datasets
View on GitHub
Markdown template for Dataseets for Datasets
☆64Apr 30, 2022Updated 4 years ago
kaltura / all-in-one-video-pack.wordpress
View on GitHub
A Wordpress Plugin to simplify adding Kaltura to your Blog
☆19Jul 9, 2026Updated 2 weeks ago