common-voice / sentence-collectorLinks
Tool to collect and review sentences for Common Voice
☆81Updated 2 years ago
Alternatives and similar repositories for sentence-collector
Users that are interested in sentence-collector are comparing it to the libraries listed below
Sorting:
- Scraping Wikipedia for fair use sentences☆54Updated last year
- All Apertium language pairs, modules, tools and core☆70Updated 4 years ago
- The code, training pipeline, and models that power Firefox Translations☆198Updated this week
- Firefox Voice is an experiment in a voice-controlled web user agent☆290Updated 4 years ago
- Mozilla Voice Community Playbook☆47Updated last year
- Mycroft.AI documentation for all public facing technical components.☆78Updated 2 years ago
- The Open Virtual Assistant☆56Updated 4 years ago
- A code for transliterating (romanizing) Arabic text using the American Library Association - Library of Congress (ALA-LC) standard☆48Updated 3 years ago
- Tooling for producing French dataset for Common Voice☆101Updated 6 months ago
- Crawler for linguistic corpora☆205Updated last year
- Mycroft's multilingual text parsing and formatting library☆77Updated last year
- Command line tool to create corpora for Common Voice☆78Updated last year
- A web framework to display Cross Linguistic Linked Data.☆57Updated 5 months ago
- Android software for recording and translation☆30Updated 9 years ago
- The Unicode Cookbook for Linguists☆56Updated 4 years ago
- The repo for the PetScan tool☆55Updated last week
- A cloud-based, open-source system for writing and publishing dictionaries.☆93Updated last year
- Wikidata lexemes presentations☆23Updated 4 months ago
- A radio for Wikimedia Commons audio files☆14Updated 4 years ago
- A Docker image for a relatively light-weight full Arabic speech synthesis system☆32Updated 4 years ago
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 4 years ago
- Repository of "CV Project" app. It's an unofficial app for Mozilla Common Voice, which permits you to contribute to this project via your…☆110Updated 2 months ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated 2 years ago
- The Global WordNet Association Collaborative Inter-Lingual Index☆44Updated 9 months ago
- 🙊 software for creating speech recognition models.☆159Updated last year
- Tool to import files from the Internet Archive to Wikimedia Commons.☆17Updated last week
- 🌻 MediaWiki extension allowing mass recording of clean, well cut, well named pronunciation files.☆16Updated last week
- 🌐 The knowledge base software that drives Wikidata.org. Mirror from https://gerrit.wikimedia.org/g/mediawiki/extensions/Wikibase. See ht…☆133Updated this week
- Transfer video and audio from external sites to Commons.☆47Updated 2 weeks ago
- An algorithm to compute token-level provenance and changes for Wiki revisioned content. Tested at +95% accuracy for EN.Wikipedia.☆31Updated 6 years ago