tatuylonen / wikitextprocessorLinks
Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. For data extraction, bulk syntax checking, error detection, and offline formatting.
☆101Updated 2 weeks ago
Alternatives and similar repositories for wikitextprocessor
Users that are interested in wikitextprocessor are comparing it to the libraries listed below
Sorting:
- A modern, interlingual wordnet interface for Python☆247Updated this week
- The Global WordNet Association Collaborative Inter-Lingual Index☆42Updated 6 months ago
- This packages up data for the Open Multilingual Wordnet☆49Updated last week
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆160Updated 2 weeks ago
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆49Updated 7 months ago
- Wiktionary dump file parser and multilingual data extractor☆923Updated this week
- Morphological Dictionaries for German Language☆29Updated 7 years ago
- A Python Wiktionary Parser☆360Updated 3 months ago
- A list of vocabulary lists☆21Updated 4 years ago
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆30Updated 5 years ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆64Updated this week
- Machine-readable Wiktionary☆76Updated last year
- WordNet-LMF formats☆21Updated last week
- The Open Multilingual Wordnet☆61Updated last year
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆47Updated 2 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆14Updated 5 years ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆91Updated last year
- Aksharamukha Python Library☆48Updated 4 months ago
- Stand-alone WordNet API☆48Updated 3 years ago
- Lexical data at Unicode☆68Updated 9 months ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆63Updated last month
- A Python library to parse MediaWiki WikiText☆309Updated 2 weeks ago
- Lexical database for ~70k English words with morphological variables☆44Updated 3 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆55Updated 10 years ago
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆80Updated 2 weeks ago
- Sentence aligner☆113Updated 4 years ago
- The Open English WordNet☆558Updated last week
- Helsinki Finite-State Technology (library and application suite)☆130Updated last week