common-voice / cv-sentence-extractorLinks
Scraping Wikipedia for fair use sentences
☆54Updated last year
Alternatives and similar repositories for cv-sentence-extractor
Users that are interested in cv-sentence-extractor are comparing it to the libraries listed below
Sorting:
- Tool to collect and review sentences for Common Voice☆81Updated 2 years ago
- The Unicode Cookbook for Linguists☆54Updated 4 years ago
- Command line tool to create corpora for Common Voice☆76Updated last year
- Python library for handling audio datasets.☆138Updated last year
- Labeled data for homograph disambiguation☆57Updated 2 years ago
- Linguistic processing for Common Voice☆55Updated last year
- Universal Romanizer that can convert any unicode script to roman (latin) script☆204Updated 10 months ago
- Facebook AI Research Automatic Speech Recognition Toolkit☆23Updated 4 years ago
- 🐸TTS recipes for different datasets☆87Updated 2 years ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆63Updated last month
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆37Updated 3 months ago
- ☆43Updated 7 years ago
- Helsinki Finite-State Technology (library and application suite)☆130Updated last week
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆86Updated last year
- Massively multilingual pronunciation mining☆340Updated 2 weeks ago
- 🙊 software for creating speech recognition models.☆159Updated last year
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 4 years ago
- Metadata and versioning details for the Common Voice dataset☆148Updated 2 months ago
- Coqui Inference Engine☆40Updated 3 years ago
- A tool for automatic phoneme transcription☆157Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆315Updated 6 months ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆137Updated last year
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabet…☆43Updated 4 years ago
- ☆22Updated 3 years ago
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆55Updated last year
- 🌻 MediaWiki extension allowing mass recording of clean, well cut, well named pronunciation files.☆16Updated 3 weeks ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆47Updated 2 years ago
- A database of number names for 186 languages, locales, and scripts☆67Updated 2 years ago
- Python module for syllabifying English ARPABET transcriptions☆66Updated 6 years ago