common-voice / sentence-collector
Tool to collect and review sentences for Common Voice
☆81Updated last year
Alternatives and similar repositories for sentence-collector:
Users that are interested in sentence-collector are comparing it to the libraries listed below
- Scraping Wikipedia for fair use sentences☆53Updated last year
- Wikidata lexemes presentations☆23Updated 2 weeks ago
- Mycroft's multilingual text parsing and formatting library☆76Updated last year
- Listening-based language learning☆53Updated last year
- Command line tool to create corpora for Common Voice☆75Updated 9 months ago
- Crawler for linguistic corpora☆204Updated last year
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆49Updated 4 months ago
- Mozilla Voice Community Playbook☆45Updated 10 months ago
- SuggestBot is an article recommender for Wikipedia☆21Updated 2 months ago
- A radio for Wikimedia Commons audio files☆14Updated 4 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 6 years ago
- Updates Wikidata entries using metadata from github☆45Updated 2 months ago
- The Open Virtual Assistant☆56Updated 3 years ago
- Lexical data at Unicode☆68Updated 6 months ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆26Updated 2 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated 10 months ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- A code for transliterating (romanizing) Arabic text using the American Library Association - Library of Congress (ALA-LC) standard☆45Updated 2 years ago
- All Apertium language pairs, modules, tools and core☆70Updated 3 years ago
- Linguistic processing for Common Voice☆55Updated last year
- Global ASP - African Storybook Project for the World☆14Updated 4 months ago
- Contributors building the Mycroft open source project☆23Updated 2 years ago
- cookiecutter template for Wikimedia Toolforge tools using Flask☆22Updated last month
- Web hub based on Wikidata☆36Updated 2 years ago
- Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by language☆10Updated last year
- The kinyarwanda model for deepspeech☆15Updated 3 years ago
- Android software for recording and translation☆30Updated 9 years ago
- 🐸TTS recipes for different datasets☆86Updated 2 years ago
- Tooling for producing French dataset for Common Voice☆101Updated 2 months ago
- Wikidata + GraphQL (Dream API for everything)☆46Updated 2 years ago