tatuylonen / wikitextprocessorLinks
Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. For data extraction, bulk syntax checking, error detection, and offline formatting.
☆107Updated this week
Alternatives and similar repositories for wikitextprocessor
Users that are interested in wikitextprocessor are comparing it to the libraries listed below
Sorting:
- A modern, interlingual wordnet interface for Python☆257Updated last month
- A Python library to parse MediaWiki WikiText☆311Updated 3 months ago
- This packages up data for the Open Multilingual Wordnet☆52Updated 2 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆170Updated 2 months ago
- A Python Wiktionary Parser☆363Updated last month
- The Global WordNet Association Collaborative Inter-Lingual Index☆45Updated 9 months ago
- The Open Multilingual Wordnet☆63Updated last year
- Python tools for interacting with Wikidata☆154Updated last year
- Compute PageRank on >3 billion Wikipedia links on off-the-shelf hardware.☆60Updated 9 months ago
- The Open English WordNet☆606Updated this week
- Machine-readable Wiktionary☆77Updated last year
- A list of vocabulary lists☆22Updated 5 years ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆73Updated last week
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 5 years ago
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆32Updated 5 years ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆93Updated last year
- Filter and format a newline-delimited JSON stream of Wikibase entities☆98Updated 2 months ago
- Faster, modernized fork of the language identification tool langid.py☆56Updated 9 months ago
- University of Colorado VerbNet☆110Updated last year
- Pipeline to generate the Standardized Project Gutenberg Corpus☆195Updated last year
- CLI for loading Wikidata subsets (or all of it) into Elasticsearch☆70Updated 3 years ago
- Morphological Dictionaries for German Language☆29Updated 7 years ago
- Sentence aligner☆116Updated 4 years ago
- A collection of open source tools and resources related to Wikibase knowledge graphs☆72Updated last year
- Entity linking system for Wikidata updated by your edits in real time☆257Updated 8 months ago
- A python module for English lemmatization and inflection.☆270Updated last year
- Multi Tier Annotation Search☆26Updated 4 years ago
- ☆74Updated last week
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆30Updated last month