common-voice / sentence-collector
Tool to collect and review sentences for Common Voice
☆81Updated last year
Alternatives and similar repositories for sentence-collector:
Users that are interested in sentence-collector are comparing it to the libraries listed below
- Scraping Wikipedia for fair use sentences☆53Updated last year
- Mozilla Voice Community Playbook☆45Updated 11 months ago
- 🌻 MediaWiki extension allowing mass recording of clean, well cut, well named pronunciation files.☆16Updated 2 months ago
- Wikidata lexemes presentations☆23Updated 2 weeks ago
- A radio for Wikimedia Commons audio files☆14Updated 4 years ago
- An algorithm to compute token-level provenance and changes for Wiki revisioned content. Tested at +95% accuracy for EN.Wikipedia.☆31Updated 6 years ago
- Metadata and versioning details for the Common Voice dataset☆146Updated last month
- The Open Virtual Assistant☆56Updated 3 years ago
- Crawler for linguistic corpora☆204Updated last year
- automate incrementally producing word pronunciation recordings for Wiktionary through Wikimedia Commons☆22Updated 7 years ago
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆49Updated 5 months ago
- This is the repo that hosts the code for Mozilla's translation service☆25Updated last year
- The repo for the PetScan tool☆50Updated last month
- Command line tool to create corpora for Common Voice☆75Updated 10 months ago
- The Global WordNet Association Collaborative Inter-Lingual Index☆42Updated 5 months ago
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 3 years ago
- Mycroft's multilingual text parsing and formatting library☆76Updated last year
- All Apertium language pairs, modules, tools and core☆70Updated 3 years ago
- Wikidata + GraphQL (Dream API for everything)☆46Updated 2 years ago
- Tool to add {{Location}} or {{Object location}} to images on Wikimedia Commons☆29Updated this week
- Website and documentation☆20Updated 4 months ago
- A webpage and API for using Mozilla DeepSpeech☆47Updated 4 years ago
- Updates Wikidata entries using metadata from github☆44Updated 3 weeks ago
- Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by language☆10Updated 2 years ago
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆20Updated 4 months ago
- A web framework to display Cross Linguistic Linked Data.☆56Updated 2 months ago
- New (2019) version of the SVG Translate tool, re-built by the WMF Community Tech team.☆17Updated last week
- Global Consent Manager Project☆16Updated last year
- Tool to import files from the Internet Archive to Wikimedia Commons.☆17Updated 2 months ago
- Linguistic processing for Common Voice☆55Updated last year