tatuylonen / wikitextprocessorLinks
Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. For data extraction, bulk syntax checking, error detection, and offline formatting.
☆102Updated last month
Alternatives and similar repositories for wikitextprocessor
Users that are interested in wikitextprocessor are comparing it to the libraries listed below
Sorting:
- Sentence aligner☆114Updated 4 years ago
- A list of vocabulary lists☆21Updated 4 years ago
- A modern, interlingual wordnet interface for Python☆250Updated last week
- The Global WordNet Association Collaborative Inter-Lingual Index☆43Updated 7 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆163Updated 2 weeks ago
- This packages up data for the Open Multilingual Wordnet☆49Updated 3 weeks ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆14Updated 5 years ago
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆50Updated 7 months ago
- ☆73Updated 2 months ago
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpus☆18Updated last year
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆30Updated 5 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆47Updated 2 years ago
- Wiktionary dump file parser and multilingual data extractor☆940Updated last week
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆66Updated 2 weeks ago
- Python Finite-State Toolkit☆56Updated this week
- The Open Multilingual Wordnet☆61Updated last year
- Machine-readable Wiktionary☆76Updated last year
- A Python library to parse MediaWiki WikiText☆311Updated last month
- German Morphological Analyzer☆47Updated 3 years ago
- An NLP pipeline for Hebrew☆38Updated last week
- Offline bilingual dictionaries made using data from Wiktionary☆55Updated 10 years ago
- Multilingual sentence alignment using sentence embeddings☆120Updated 7 months ago
- Extract data from German Wiktionary XML files.☆26Updated 5 months ago
- A Python Wiktionary Parser☆360Updated 4 months ago
- The Unicode Cookbook for Linguists☆54Updated 4 years ago
- WordNet-LMF formats☆21Updated 2 weeks ago
- Improved Sentence Alignment in Linear Time and Space☆174Updated 2 years ago
- Morphological Dictionaries for German Language☆29Updated 7 years ago
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆30Updated 3 years ago