common-voice / common-voice-bundler
Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by language
☆10Updated 2 years ago
Alternatives and similar repositories for common-voice-bundler:
Users that are interested in common-voice-bundler are comparing it to the libraries listed below
- Metadata and versioning details for the Common Voice dataset☆146Updated last month
- Command line tool to create corpora for Common Voice☆75Updated 10 months ago
- Scraping Wikipedia for fair use sentences☆53Updated last year
- Tool to collect and review sentences for Common Voice☆81Updated last year
- Mozilla Voice Community Playbook☆45Updated 11 months ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆205Updated 9 months ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 4 years ago
- Program to benchmark various speech recognition APIs☆80Updated 5 years ago
- Python library for handling audio datasets.☆137Updated last year
- 🐸STT integration examples☆127Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆310Updated 5 months ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 3 years ago
- Port of the OpenFST library to Windows☆73Updated last year
- Mycroft's multilingual text parsing and formatting library☆76Updated last year
- End-to-end speech recognition using RNN Transducers in Tensorflow 2.0☆244Updated 4 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆293Updated 3 years ago
- Identifying people from small audio fragments☆170Updated 5 years ago
- dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.☆494Updated last year
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated 11 months ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆26Updated 2 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 5 years ago
- Linguistic processing for Common Voice☆55Updated last year
- A collection of useful tools for handling speech recognition data☆30Updated 2 years ago
- GStreamer plugin around Kaldi's online neural network decoder☆185Updated 4 years ago
- Phonetisaurus G2P☆471Updated 10 months ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆207Updated 3 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆154Updated 5 years ago
- PyTorch implementations of neural network models for keyword spotting☆515Updated last year
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago