Scripts to simplify data prepping for Mozilla DeepSpeech.
☆14Aug 6, 2019Updated 6 years ago
Alternatives and similar repositories for deepspeech-tools
Users that are interested in deepspeech-tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Script to train a German n-gram Language Model on articles of Wikipedia☆14Oct 20, 2018Updated 7 years ago
- Crawling and creating a German language model resource☆18Aug 23, 2022Updated 3 years ago
- Python package for the Zero Speech Challenge 2020☆14Feb 5, 2021Updated 5 years ago
- A very basic demonstration connecting speech recognition and text-to-speech☆20May 3, 2020Updated 5 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Mar 18, 2019Updated 7 years ago
- A webpage and API for using Mozilla DeepSpeech☆48Feb 24, 2021Updated 5 years ago
- Coqui Inference Engine☆41Aug 3, 2021Updated 4 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆103May 29, 2020Updated 5 years ago
- ipython notebooks for feature extraction and training of audio event classifier on ESC-50 dataset.☆10Mar 16, 2018Updated 8 years ago
- AVPipe :-)☆12Jul 16, 2021Updated 4 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆82May 3, 2024Updated last year
- DeepSpeech based forced alignment tool☆239Dec 12, 2020Updated 5 years ago
- Wrapper for the yr.no weather service API.☆15Apr 12, 2018Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The code enables users to use Mozilla's Deep Speech model over the Web Browser.☆32Jan 4, 2023Updated 3 years ago
- Windows 💻 RobustVideoMatting with ONNXRuntime/MNN/TNN C++/Python☆12Mar 10, 2022Updated 4 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- A small package for handy conversion of german numerals (also ordinal / signed) written as words to numbers.☆12Jan 22, 2026Updated 2 months ago
- Keyword Spotting using BCResNet and Arcface Loss☆11Jan 28, 2022Updated 4 years ago
- ☆12Nov 6, 2015Updated 10 years ago
- ncnn & tnn & mnn 三合一的安卓 Camera & Gallery 工程☆14Jul 22, 2022Updated 3 years ago
- Android app that allows device discovery on WLAN (w/ Bonjour) and video calls to be placed between devices on WLAN (w/ WebRTC) without an…☆24Apr 1, 2026Updated last week
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆214Jul 25, 2024Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆59Apr 17, 2024Updated last year
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- An audio steganalysis method based on CNN in the time domain.☆12Feb 25, 2021Updated 5 years ago
- Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech Recognition for Welsh with HuggingFace☆13Nov 29, 2022Updated 3 years ago
- ☆13Jan 8, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Read audio with FFmpeg into NumPy/PyTorch via ctypes (standard library module)☆11Aug 12, 2020Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 4 years ago
- A study of the downstream instability of word embeddings☆12Aug 23, 2022Updated 3 years ago
- ☆15Nov 1, 2018Updated 7 years ago
- Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.☆19Apr 22, 2019Updated 6 years ago
- Web service for the OpenPLZ API project☆35Feb 12, 2026Updated 2 months ago
- Unsupervised spoken sentence embeddings☆14Dec 14, 2022Updated 3 years ago