common-voice / common-voice-bundler
Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by language
☆11Updated 2 years ago
Alternatives and similar repositories for common-voice-bundler
Users that are interested in common-voice-bundler are comparing it to the libraries listed below
Sorting:
- Command line tool to create corpora for Common Voice☆76Updated 11 months ago
- Metadata and versioning details for the Common Voice dataset☆146Updated last month
- Tool to collect and review sentences for Common Voice☆81Updated 2 years ago
- 🙊 software for creating speech recognition models.☆159Updated 11 months ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 5 years ago
- Scraping Wikipedia for fair use sentences☆54Updated last year
- BurrMill core☆21Updated 3 years ago
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 4 years ago
- Official home of the Idlak Speech Synthesis Toolkit☆66Updated 3 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated 2 years ago
- Port of the OpenFST library to Windows☆73Updated last year
- Linguistic processing for Common Voice☆55Updated last year
- Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora☆29Updated 2 years ago
- Crawling and creating a German language model resource☆19Updated 2 years ago
- 🐸TTS recipes for different datasets☆87Updated 2 years ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- C++ Implementation of the Information Bottleneck System☆23Updated 6 years ago
- A suite of speech signal processing tools☆233Updated this week
- Universal Romanizer that can convert any unicode script to roman (latin) script☆197Updated 9 months ago
- Scripts for training Kaldi for German speech recognition (ASR).☆24Updated 4 years ago
- Coqui Inference Engine☆40Updated 3 years ago
- Python library for handling audio datasets.☆138Updated last year
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- C++ library for converting text to phonemes for Piper☆118Updated last year
- Program to benchmark various speech recognition APIs☆80Updated 5 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
- This repository is a collection of TTS Models in TFLite☆192Updated 4 years ago
- 🐸STT integration examples☆126Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago