common-voice / common-voice-bundler
Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by language
☆10Updated last year
Alternatives and similar repositories for common-voice-bundler:
Users that are interested in common-voice-bundler are comparing it to the libraries listed below
- Metadata and versioning details for the Common Voice dataset☆146Updated last week
- Command line tool to create corpora for Common Voice☆75Updated 10 months ago
- Program to benchmark various speech recognition APIs☆80Updated 5 years ago
- Scraping Wikipedia for fair use sentences☆53Updated last year
- 🐸STT integration examples☆127Updated 2 years ago
- Official home of the Idlak Speech Synthesis Toolkit☆66Updated 3 years ago
- Tool to collect and review sentences for Common Voice☆81Updated last year
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Updated 5 years ago
- C++ Implementation of the Information Bottleneck System☆23Updated 6 years ago
- Linguistic processing for Common Voice☆55Updated last year
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated last year
- BurrMill core☆21Updated 3 years ago
- Server framework for Kaldi ASR Toolkit☆98Updated last year
- Edinburgh Speech Tools☆58Updated last year
- 🙊 software for creating speech recognition models.☆158Updated 9 months ago
- Port of the OpenFST library to Windows☆71Updated 11 months ago
- Scripts for training Kaldi for German speech recognition (ASR).☆24Updated 4 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆204Updated 8 months ago
- ☆39Updated last year
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- This is a github repository of the abandonware Sequitur G2P by Bisani & Ney☆161Updated 8 months ago
- Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora☆28Updated 2 years ago
- Facebook AI Research Automatic Speech Recognition Toolkit☆23Updated 4 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated 10 months ago
- Crawling and creating a German language model resource☆19Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆308Updated 4 months ago
- A collection of basic python modules for spoken natural language processing☆56Updated 5 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated 2 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆292Updated 3 years ago
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆173Updated last year