common-voice / common-voice-bundlerLinks
Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by language
☆11Updated 2 years ago
Alternatives and similar repositories for common-voice-bundler
Users that are interested in common-voice-bundler are comparing it to the libraries listed below
Sorting:
- Command line tool to create corpora for Common Voice☆77Updated last year
- Scraping Wikipedia for fair use sentences☆54Updated last year
- Metadata and versioning details for the Common Voice dataset☆148Updated this week
- Tool to collect and review sentences for Common Voice☆81Updated 2 years ago
- Python library for handling audio datasets.☆138Updated last year
- BurrMill core☆21Updated 3 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 5 years ago
- Scripts for training Kaldi for German speech recognition (ASR).☆24Updated 4 years ago
- Facebook AI Research Automatic Speech Recognition Toolkit☆23Updated 4 years ago
- Official home of the Idlak Speech Synthesis Toolkit☆66Updated 3 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆154Updated 5 years ago
- Coqui Inference Engine☆40Updated 3 years ago
- Crawling and creating a German language model resource☆19Updated 2 years ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Linguistic processing for Common Voice☆55Updated last year
- 🐸STT integration examples☆129Updated 2 years ago
- Dockerfile for compiling Kaldi for Android.☆66Updated 6 years ago
- Repository for the web pages and scripts associated with OpenSLR: the open speech and language repository☆25Updated 4 years ago
- Small language toolkit for creation, interpolation and pruning of ARPA language models☆92Updated 2 years ago
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 4 years ago
- Port of the OpenFST library to Windows☆77Updated last year
- Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech Recognition for Welsh with HuggingFace☆14Updated 2 years ago
- A modification of https://github.com/Rayhane-mamah/Tacotron-2 that is intended for use with the Swedish language.☆9Updated 6 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆209Updated 3 years ago
- 🐸TTS recipes for different datasets☆87Updated 2 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆174Updated 6 months ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- Mozilla Voice Community Playbook☆46Updated last year
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Updated 4 months ago
- 🙊 software for creating speech recognition models.☆159Updated last year