common-voice / sentence-collectorLinks
Tool to collect and review sentences for Common Voice
☆81Updated 2 years ago
Alternatives and similar repositories for sentence-collector
Users that are interested in sentence-collector are comparing it to the libraries listed below
Sorting:
- Scraping Wikipedia for fair use sentences☆54Updated last year
- Mozilla Voice Community Playbook☆48Updated last year
- Firefox Voice is an experiment in a voice-controlled web user agent☆290Updated 4 years ago
- Crawler for linguistic corpora☆210Updated 3 months ago
- Wikidata lexemes presentations☆23Updated 7 months ago
- The code, training pipeline, and models that power Firefox Translations☆217Updated last week
- The Open Virtual Assistant☆56Updated 4 years ago
- Command line tool to create corpora for Common Voice☆78Updated last year
- A cloud-based, open-source system for writing and publishing dictionaries.☆96Updated last year
- A code for transliterating (romanizing) Arabic text using the American Library Association - Library of Congress (ALA-LC) standard☆50Updated 3 years ago
- Tooling for producing French dataset for Common Voice☆101Updated 10 months ago
- Mycroft's multilingual text parsing and formatting library☆78Updated 2 years ago
- The Global WordNet Association Collaborative Inter-Lingual Index☆47Updated last year
- An algorithm to compute token-level provenance and changes for Wiki revisioned content. Tested at +95% accuracy for EN.Wikipedia.☆32Updated 6 years ago
- Listening-based language learning☆67Updated last year
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- The daily list of Wikipedia's most-visited articles☆33Updated 2 months ago
- Collaborative data curation for Glottolog☆177Updated last month
- The repo for the PetScan tool☆57Updated last month
- Mycroft.AI documentation for all public facing technical components.☆78Updated 2 years ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆78Updated last week
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆35Updated 2 years ago
- The Unicode Cookbook for Linguists☆56Updated 5 years ago
- A webpage and API for using Mozilla DeepSpeech☆48Updated 4 years ago
- A living document for all things Common Voice.☆14Updated last year
- Lexical data at Unicode☆70Updated last year
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 4 years ago
- 🌻 MediaWiki extension allowing mass recording of clean, well cut, well named pronunciation files.☆16Updated this week
- Scripts for training Kaldi for German speech recognition (ASR).☆26Updated 4 years ago
- MediaWiki extension to handle multilingual abstract content☆78Updated last year