common-voice / common-voice-bundlerLinks
Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by language
☆11Updated 2 years ago
Alternatives and similar repositories for common-voice-bundler
Users that are interested in common-voice-bundler are comparing it to the libraries listed below
Sorting:
- Command line tool to create corpora for Common Voice☆78Updated last year
- Metadata and versioning details for the Common Voice dataset☆152Updated 2 months ago
- Tool to collect and review sentences for Common Voice☆81Updated 2 years ago
- C++ Implementation of the Information Bottleneck System☆23Updated 6 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- Official home of the Idlak Speech Synthesis Toolkit☆66Updated 3 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆128Updated 9 months ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆324Updated 9 months ago
- Program to benchmark various speech recognition APIs☆80Updated 5 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated 2 years ago
- Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time☆344Updated this week
- 🙊 software for creating speech recognition models.☆159Updated last year
- Mozilla Voice Community Playbook☆47Updated last year
- 🐸STT integration examples☆129Updated 2 years ago
- Port of the OpenFST library to Windows☆79Updated last year
- Facebook AI Research Automatic Speech Recognition Toolkit☆23Updated 4 years ago
- dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.☆508Updated 2 years ago
- Coqui Inference Engine☆41Updated 4 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 6 years ago
- Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech Recognition for Welsh with HuggingFace☆14Updated 2 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 5 years ago
- Gecko - A Tool for Effective Annotation of Human Conversations☆295Updated 2 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆213Updated last year
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆173Updated 2 years ago
- Dockerfile for compiling Kaldi for Android.☆66Updated 6 years ago
- Edinburgh Speech Tools☆60Updated 2 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆293Updated 4 years ago
- Datasets and tools for basic natural language processing.☆386Updated 3 years ago
- This is a github repository of the abandonware Sequitur G2P by Bisani & Ney☆170Updated last year
- Scripts for training Kaldi for German speech recognition (ASR).☆24Updated 4 years ago