tatuylonen / wikitextprocessorLinks
Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. For data extraction, bulk syntax checking, error detection, and offline formatting.
☆108Updated 2 weeks ago
Alternatives and similar repositories for wikitextprocessor
Users that are interested in wikitextprocessor are comparing it to the libraries listed below
Sorting:
- A modern, interlingual wordnet interface for Python☆276Updated this week
- A Python library to parse MediaWiki WikiText☆317Updated 6 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆180Updated 6 months ago
- A Python Wiktionary Parser☆367Updated 4 months ago
- This packages up data for the Open Multilingual Wordnet☆58Updated 6 months ago
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆33Updated 6 years ago
- The Open Multilingual Wordnet☆66Updated last year
- The Global WordNet Association Collaborative Inter-Lingual Index☆49Updated last year
- Wiktionary dump file parser and multilingual data extractor☆1,050Updated this week
- A python module for English lemmatization and inflection.☆274Updated 2 years ago
- ☆79Updated this week
- The World Atlas of Language Structures☆72Updated last year
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- The Open English WordNet☆673Updated this week
- Aksharamukha Python Library☆55Updated 10 months ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆97Updated last year
- A list of vocabulary lists☆22Updated 5 years ago
- Morphological Dictionaries for German Language☆30Updated 7 years ago
- Machine-readable Wiktionary☆77Updated last year
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆255Updated 3 years ago
- Python Finite-State Toolkit☆60Updated 2 weeks ago
- University of Colorado VerbNet☆115Updated last year
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆52Updated 2 years ago
- Gather modern English word frequencies from all enwiki articles.☆227Updated last year
- Offline bilingual dictionaries made using data from Wiktionary☆62Updated 10 years ago
- A multilingual parallel corpus created from translations of the Bible.☆191Updated 6 months ago
- Sentence aligner☆121Updated 4 years ago
- Collaborative data curation for Glottolog☆177Updated last week
- Python tools for interacting with Wikidata☆158Updated 2 years ago
- Bitextor generates translation memories from multilingual websites☆298Updated last year