Scripts to simplify data prepping for Mozilla DeepSpeech.
☆14Aug 6, 2019Updated 6 years ago
Alternatives and similar repositories for deepspeech-tools
Users that are interested in deepspeech-tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Script to train a German n-gram Language Model on articles of Wikipedia☆14Oct 20, 2018Updated 7 years ago
- Crawling and creating a German language model resource☆18Aug 23, 2022Updated 3 years ago
- Python package for the Zero Speech Challenge 2020☆14Feb 5, 2021Updated 5 years ago
- A very basic demonstration connecting speech recognition and text-to-speech☆20May 3, 2020Updated 6 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- IPA Phonetic dataset lexicon☆18May 26, 2026Updated 2 weeks ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Mar 18, 2019Updated 7 years ago
- A webpage and API for using Mozilla DeepSpeech☆48Feb 24, 2021Updated 5 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆103May 29, 2020Updated 6 years ago
- AVPipe :-)☆12Jul 16, 2021Updated 4 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆82May 3, 2024Updated 2 years ago
- DeepSpeech based forced alignment tool☆239Dec 12, 2020Updated 5 years ago
- Wrapper for the yr.no weather service API.☆15Apr 12, 2018Updated 8 years ago
- The code enables users to use Mozilla's Deep Speech model over the Web Browser.☆32Jan 4, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- A small package for handy conversion of german numerals (also ordinal / signed) written as words to numbers.☆12Jan 22, 2026Updated 4 months ago
- voice services stack from audio hardware through hotword, ASR, NLU, AI routing and TTS bound by messaging protocol over MQTT☆93Jul 21, 2023Updated 2 years ago
- ☆12Nov 6, 2015Updated 10 years ago
- edge/mobile transformer based Vision DNN inference benchmark☆16Aug 29, 2025Updated 9 months ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- Self improving agents through iterations☆92Updated this week
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Keyword Spotting using BCResNet and Arcface Loss☆13Jan 28, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆214Jul 25, 2024Updated last year
- Highly encrypted open source Chat and VOIP app with Firebase, supporting conference and p2p calls with jitsi☆11Feb 2, 2023Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- A very tiny python api for the stock exchange tradegate.de☆16Jan 20, 2022Updated 4 years ago
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆59Apr 17, 2024Updated 2 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- Tensorflow training scripts for depthwise separable convolutional neural networks for keyword spotting, and C++ code for deployment.☆41Apr 2, 2020Updated 6 years ago
- Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech Recognition for Welsh with HuggingFace☆13Nov 29, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆14Jan 8, 2024Updated 2 years ago
- Read audio with FFmpeg into NumPy/PyTorch via ctypes (standard library module)☆11Aug 12, 2020Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆16Mar 26, 2022Updated 4 years ago
- A study of the downstream instability of word embeddings☆12Aug 23, 2022Updated 3 years ago
- ☆15Nov 1, 2018Updated 7 years ago
- Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.☆19Apr 22, 2019Updated 7 years ago
- AR.Drone 2.0 human tracking with Mobilenet-SSD and PID control☆17Sep 19, 2019Updated 6 years ago