tatuylonen / wikitextprocessorLinks
Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. For data extraction, bulk syntax checking, error detection, and offline formatting.
☆107Updated last month
Alternatives and similar repositories for wikitextprocessor
Users that are interested in wikitextprocessor are comparing it to the libraries listed below
Sorting:
- A modern, interlingual wordnet interface for Python☆278Updated this week
- This packages up data for the Open Multilingual Wordnet☆59Updated last week
- A Python Wiktionary Parser☆369Updated 5 months ago
- The Global WordNet Association Collaborative Inter-Lingual Index☆50Updated last year
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆182Updated 7 months ago
- A Python library to parse MediaWiki WikiText☆315Updated 8 months ago
- The Open Multilingual Wordnet☆66Updated last year
- Machine-readable Wiktionary☆77Updated last year
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆33Updated 6 years ago
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- Python Finite-State Toolkit☆60Updated 3 weeks ago
- A list of vocabulary lists☆22Updated 5 years ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 6 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆57Updated 4 years ago
- A python module for English lemmatization and inflection.☆274Updated 2 years ago
- Collaborative data curation for Glottolog☆182Updated 2 weeks ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆98Updated 2 years ago
- The Open English WordNet☆701Updated last week
- Morphological Dictionaries for German Language☆30Updated 7 years ago
- ☆81Updated this week
- Gather modern English word frequencies from all enwiki articles.☆227Updated last year
- The World Atlas of Language Structures☆72Updated last year
- University of Colorado VerbNet☆119Updated last year
- Python tools for interacting with Wikidata☆160Updated 2 years ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆75Updated 4 months ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆52Updated 2 years ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆74Updated last year
- WordNet-LMF formats☆24Updated 2 months ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆78Updated this week
- Wiktionary dump file parser and multilingual data extractor☆1,076Updated last week