Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. For data extraction, bulk syntax checking, error detection, and offline formatting.
☆107Feb 9, 2026Updated 3 weeks ago
Alternatives and similar repositories for wikitextprocessor
Users that are interested in wikitextprocessor are comparing it to the libraries listed below
Sorting:
- Wiktionary dump file parser and multilingual data extractor☆1,108Updated this week
- A Python library to parse MediaWiki WikiText☆319May 15, 2025Updated 9 months ago
- Analytic tableau based minimal model generator, model checker and theorem prover for first-order logic with modal extensions☆20Aug 22, 2025Updated 6 months ago
- A Python parser for MediaWiki wikicode☆862Jul 1, 2025Updated 8 months ago
- Pronunciation dictionaries for several languages, based on Wiktionary data.☆21Nov 28, 2021Updated 4 years ago
- Tool for generating filtered Wikidata RDF exports☆44Apr 9, 2022Updated 3 years ago
- A Python Wiktionary Parser☆371Jul 23, 2025Updated 7 months ago
- ☆11Jun 11, 2025Updated 8 months ago
- Tools to process OpenAlex raw snapshot files☆12Jan 17, 2025Updated last year
- BabelNet (and WordNet) sense embedding trained with Word2Vec and FastText☆10Sep 3, 2019Updated 6 years ago
- Import workflows for the Wikipedia Citations Database☆14Feb 19, 2026Updated last week
- Datasets for the Monolingual Word Sense Alignment (MWSA) task☆12Nov 10, 2020Updated 5 years ago
- Extract data from German Wiktionary XML files.☆26Jan 8, 2026Updated last month
- UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.☆29May 14, 2025Updated 9 months ago
- Inflecting Finnish words (verb inflection, comparatives, cases, possessive suffixes, clitics) using Wiktionary-compatible declensions and…☆34Sep 6, 2020Updated 5 years ago
- ☆16Jan 20, 2022Updated 4 years ago
- import information (affiliation, education) from ORCID database to Wikidata regarding authors of scientific papers☆16May 25, 2023Updated 2 years ago
- A simple n-gram language model.☆12Sep 11, 2018Updated 7 years ago
- Compile wikitext to HTML: wikitext as a templating language.☆16Feb 15, 2026Updated 2 weeks ago
- Wikipedia Bilingual Reference Data (English)☆17Jun 17, 2016Updated 9 years ago
- Scripts for compatibilitising between VISL-CG3, Apertium, CoNLL-X and Universal Dependencies☆17Mar 4, 2020Updated 5 years ago
- A collection of open source tools and resources related to Wikibase knowledge graphs☆75Sep 9, 2025Updated 5 months ago
- Gramadán: a computational grammar of Irish☆17Jan 23, 2023Updated 3 years ago
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library. Fork of https://github.com/pytries/DAWG☆16Jan 1, 2026Updated 2 months ago
- A dictionary for Middle Egyptian hieroglyphics.☆17Jan 21, 2026Updated last month
- PPSR Core standards repsository☆25Feb 17, 2026Updated last week
- A knowledge integration framework based on Wikidata☆23Nov 6, 2025Updated 3 months ago
- Imports Wiktionary's grammatical data into Wikidata☆18Jan 11, 2020Updated 6 years ago
- Scripts for Wikidata☆21Jan 10, 2026Updated last month
- Python bot framework for Lexemes on Wikidata☆19Feb 6, 2021Updated 5 years ago
- Parses Wikipedia citation templates in Python☆17Mar 26, 2025Updated 11 months ago
- Curated list of Wikidata Projects☆23Dec 7, 2025Updated 2 months ago
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆26Apr 25, 2017Updated 8 years ago
- Own pywikibot scripts (for Wikimedia projects)☆21Nov 30, 2025Updated 3 months ago
- Unexport is a linter that tries to keep the __all __ in your Python modules always up to date.☆23Dec 20, 2022Updated 3 years ago
- CMU dictionary in IPA instead of their subset of Arpabet☆16Sep 24, 2024Updated last year
- Machine-readable Wiktionary☆78May 6, 2024Updated last year
- A library for fetching and reading Tatoeba's weekly exports☆24Feb 5, 2026Updated 3 weeks ago
- Java client library for integration with Freja eID☆12Updated this week