Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. For data extraction, bulk syntax checking, error detection, and offline formatting.
☆108Mar 9, 2026Updated 2 weeks ago
Alternatives and similar repositories for wikitextprocessor
Users that are interested in wikitextprocessor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Wiktionary dump file parser and multilingual data extractor☆1,122Mar 16, 2026Updated last week
- Code to create a database with cleaned up Wiktionary data and then to create ebook dictionaries based on this data.☆33Aug 16, 2023Updated 2 years ago
- Pronunciation dictionaries for several languages, based on Wiktionary data.☆21Nov 28, 2021Updated 4 years ago
- Analytic tableau based minimal model generator, model checker and theorem prover for first-order logic with modal extensions☆20Aug 22, 2025Updated 7 months ago
- A Python library to parse MediaWiki WikiText☆320May 15, 2025Updated 10 months ago
- A Python parser for MediaWiki wikicode☆866Mar 16, 2026Updated last week
- A Python Wiktionary Parser☆370Jul 23, 2025Updated 8 months ago
- Web front end for WikDict dictionaries☆20Nov 2, 2025Updated 4 months ago
- repository for matching Wikidata with riksdagen-corpus☆14Nov 15, 2025Updated 4 months ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Nov 9, 2021Updated 4 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆115May 7, 2024Updated last year
- Tool for generating filtered Wikidata RDF exports☆44Apr 9, 2022Updated 3 years ago
- Classical CHAT80 NLP system for Prolog☆25Feb 27, 2025Updated last year
- HFST spell checker library and command line tool☆14Feb 20, 2024Updated 2 years ago
- Datasets for the Monolingual Word Sense Alignment (MWSA) task☆12Nov 10, 2020Updated 5 years ago
- Processing the grammar dictionary of A. A. Zaliznyak for morphological inflection☆19Jun 4, 2020Updated 5 years ago
- Import workflows for the Wikipedia Citations Database☆13Feb 19, 2026Updated last month
- Logical inference system based on event semantics and degree semantics in formal semantics☆11Jan 22, 2023Updated 3 years ago
- import information (affiliation, education) from ORCID database to Wikidata regarding authors of scientific papers☆16May 25, 2023Updated 2 years ago
- RISCV Core written in Calyx☆17Aug 16, 2024Updated last year
- Feature set algebra for linguistics☆17Jan 19, 2026Updated 2 months ago
- Wikipedia Bilingual Reference Data (English)☆17Jun 17, 2016Updated 9 years ago
- Classes and methods for Geometric Deep Learning to support Substack, LinkedIn newsletters and tutorials☆23Mar 14, 2026Updated last week
- Scripts for compatibilitising between VISL-CG3, Apertium, CoNLL-X and Universal Dependencies☆17Mar 4, 2020Updated 6 years ago
- GOPHI: an AMR-to-English Verbalizer☆11Feb 5, 2020Updated 6 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆73Jul 24, 2023Updated 2 years ago
- Extract data from German Wiktionary XML files.☆26Jan 8, 2026Updated 2 months ago
- Easily type Indian languages on macOS !☆16Nov 14, 2023Updated 2 years ago
- Master's project☆19Sep 11, 2019Updated 6 years ago
- A library to encode text as DNA and decode DNA to text.☆13Nov 21, 2022Updated 3 years ago
- A social media open post web archiving tool☆26Feb 4, 2026Updated last month
- A dictionary for Middle Egyptian hieroglyphics.☆17Jan 21, 2026Updated 2 months ago
- IPA Pronunciation Dictionaries in DSL format☆44Jan 13, 2017Updated 9 years ago
- XED multilingual emotion datasets☆64May 3, 2023Updated 2 years ago
- ☆11Nov 17, 2018Updated 7 years ago
- ☆10Jun 11, 2019Updated 6 years ago
- Java client library for integration with Freja eID☆12Feb 27, 2026Updated 3 weeks ago
- CLI tool to record how much time it takes to import each dependency in a Python project☆12Mar 24, 2022Updated 3 years ago
- OpenXTalk Community Don't Panic Edition! Cross-platform development environment (IDE) with a foundation built on xTalk Scripting Language…☆19Jan 29, 2026Updated last month