common-voice / cv-sentence-extractor
Scraping Wikipedia for fair use sentences
☆54Updated last year
Alternatives and similar repositories for cv-sentence-extractor
Users that are interested in cv-sentence-extractor are comparing it to the libraries listed below
Sorting:
- Tool to collect and review sentences for Common Voice☆81Updated 2 years ago
- Command line tool to create corpora for Common Voice☆76Updated 11 months ago
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabet…☆44Updated 4 years ago
- Efficient teacher-student models and scripts to make them☆50Updated last year
- Unicode Standard tokenization routines and orthography profile segmentation☆37Updated 2 months ago
- 🙊 software for creating speech recognition models.☆159Updated 11 months ago
- Linguistic processing for Common Voice☆55Updated last year
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆55Updated last year
- Metadata and versioning details for the Common Voice dataset☆146Updated last month
- The Unicode Cookbook for Linguists☆54Updated 4 years ago
- 🐸TTS recipes for different datasets☆87Updated 2 years ago
- Massively multilingual pronunciation mining☆340Updated 3 weeks ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆197Updated 9 months ago
- Crawler for linguistic corpora☆204Updated last year
- PocketSphinx phonetic feature extraction for intelligibility prediction and remediation☆29Updated 4 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 4 years ago
- Mozilla Voice Community Playbook☆46Updated 11 months ago
- Coqui Inference Engine☆40Updated 3 years ago
- Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora☆29Updated 2 years ago
- Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by language☆11Updated 2 years ago
- The CMU Pronouncing Dictionary converted to IPA☆82Updated 5 years ago
- CMU Wilderness Multilingual Speech Dataset☆280Updated 6 years ago
- A guide to building language technology in new languages.☆58Updated 3 years ago
- ☆36Updated 10 months ago
- Convert native orthographies to the International Phonetic Alphabet☆14Updated 2 years ago
- Python module for syllabifying English ARPABET transcriptions☆66Updated 6 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 6 years ago
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆85Updated last year
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 5 years ago