Scripts to simplify data prepping for Mozilla DeepSpeech.
☆14Aug 6, 2019Updated 6 years ago
Alternatives and similar repositories for deepspeech-tools
Users that are interested in deepspeech-tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Crawling and creating a German language model resource☆18Aug 23, 2022Updated 3 years ago
- Main database of generative AI systems☆20Updated this week
- Python package for the Zero Speech Challenge 2020☆14Feb 5, 2021Updated 5 years ago
- Mozilla deepspeech server implemented in django.☆49Jun 10, 2021Updated 4 years ago
- A very basic demonstration connecting speech recognition and text-to-speech☆20May 3, 2020Updated 5 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- Fast word segmentation with a focus on splitting #hashtags☆14Sep 29, 2021Updated 4 years ago
- IPA Phonetic dataset lexicon☆18Updated this week
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Mar 18, 2019Updated 7 years ago
- A webpage and API for using Mozilla DeepSpeech☆48Feb 24, 2021Updated 5 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆103May 29, 2020Updated 5 years ago
- ipython notebooks for feature extraction and training of audio event classifier on ESC-50 dataset.☆10Mar 16, 2018Updated 8 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆82May 3, 2024Updated last year
- Wrapper for the yr.no weather service API.☆15Apr 12, 2018Updated 7 years ago
- The code enables users to use Mozilla's Deep Speech model over the Web Browser.☆32Jan 4, 2023Updated 3 years ago
- Windows 💻 RobustVideoMatting with ONNXRuntime/MNN/TNN C++/Python☆12Mar 10, 2022Updated 4 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- A small package for handy conversion of german numerals (also ordinal / signed) written as words to numbers.☆12Jan 22, 2026Updated 2 months ago
- Android app that allows device discovery on WLAN (w/ Bonjour) and video calls to be placed between devices on WLAN (w/ WebRTC) without an…☆21Feb 19, 2026Updated last month
- edge/mobile transformer based Vision DNN inference benchmark☆16Aug 29, 2025Updated 6 months ago
- Constrained Manipulability is a library used to compute and visualize a robot's capacities in constrained environments.☆14Apr 23, 2025Updated 11 months ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Highly encrypted open source Chat and VOIP app with Firebase, supporting conference and p2p calls with jitsi☆11Feb 2, 2023Updated 3 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆214Jul 25, 2024Updated last year
- Noisy Quantum Gates model for simulating the noise of quantum devices.☆19Mar 17, 2026Updated last week
- A very tiny python api for the stock exchange tradegate.de☆14Jan 20, 2022Updated 4 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- [DEPRECATED] Sandboxing plugin for launch_ros☆15Jan 28, 2022Updated 4 years ago
- Devbox: Prepare your python development environment -Neovim with kickstarter.nvim☆16May 28, 2023Updated 2 years ago
- Tensorflow training scripts for depthwise separable convolutional neural networks for keyword spotting, and C++ code for deployment.☆41Apr 2, 2020Updated 5 years ago
- ☆13Jan 8, 2024Updated 2 years ago
- Read audio with FFmpeg into NumPy/PyTorch via ctypes (standard library module)☆11Aug 12, 2020Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- A study of the downstream instability of word embeddings☆12Aug 23, 2022Updated 3 years ago
- Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.☆19Apr 22, 2019Updated 6 years ago
- AR.Drone 2.0 human tracking with Mobilenet-SSD and PID control☆17Sep 19, 2019Updated 6 years ago
- Unsupervised spoken sentence embeddings☆14Dec 14, 2022Updated 3 years ago
- a cross-platform audio engine,including audio device ,audio processing audio codec,etc.☆14Nov 18, 2017Updated 8 years ago