Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. For data extraction, bulk syntax checking, error detection, and offline formatting.
☆112Jun 24, 2026Updated last week
Alternatives and similar repositories for wikitextprocessor
Users that are interested in wikitextprocessor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Wiktionary dump file parser and multilingual data extractor☆1,197Jun 23, 2026Updated last week
- A comprehensive and extensible Wiktionary parsing framework.☆25Sep 5, 2024Updated last year
- A Python parser for MediaWiki wikicode☆880Jun 12, 2026Updated 3 weeks ago
- A Python library to parse MediaWiki WikiText☆324Jun 25, 2026Updated last week
- A Python Wiktionary Parser☆375Jul 23, 2025Updated 11 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆25Apr 28, 2020Updated 6 years ago
- Web front end for WikDict dictionaries☆20Jun 19, 2026Updated last week
- repository for matching Wikidata with riksdagen-corpus☆14Nov 15, 2025Updated 7 months ago
- UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.☆30May 20, 2026Updated last month
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆116May 7, 2024Updated 2 years ago
- Tool for generating filtered Wikidata RDF exports☆46Apr 9, 2022Updated 4 years ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆45Aug 7, 2024Updated last year
- BabelNet (and WordNet) sense embedding trained with Word2Vec and FastText☆10Sep 3, 2019Updated 6 years ago
- Classical CHAT80 NLP system for Prolog☆25Feb 27, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Processing the grammar dictionary of A. A. Zaliznyak for morphological inflection☆19Jun 4, 2020Updated 6 years ago
- Import workflows for the Wikipedia Citations Database☆13Jun 23, 2026Updated last week
- Tools to process OpenAlex raw snapshot files☆12Mar 23, 2026Updated 3 months ago
- Python Unicode Block Utilities☆24Oct 23, 2025Updated 8 months ago
- Compile wikitext to HTML: wikitext as a templating language.☆16Feb 15, 2026Updated 4 months ago
- phone inventory library☆17May 15, 2023Updated 3 years ago
- Feature set algebra for linguistics☆17Jan 19, 2026Updated 5 months ago
- Wikipedia Bilingual Reference Data (English)☆17Jun 17, 2016Updated 10 years ago
- Scripts for compatibilitising between VISL-CG3, Apertium, CoNLL-X and Universal Dependencies☆17Mar 4, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A library for fetching and reading Tatoeba's weekly exports☆24Feb 5, 2026Updated 4 months ago
- GOPHI: an AMR-to-English Verbalizer☆11Feb 5, 2020Updated 6 years ago
- Find the origin of words in every language using a Deep Neural Network trained to create an etymological map.☆22May 18, 2018Updated 8 years ago
- Automatic Detection of Potentially Idiomatic Expressions☆12Feb 19, 2021Updated 5 years ago
- Master's project☆19Sep 11, 2019Updated 6 years ago
- ☆17Jan 20, 2022Updated 4 years ago
- A library to encode text as DNA and decode DNA to text.☆14Nov 21, 2022Updated 3 years ago
- A dictionary for Middle Egyptian hieroglyphics.☆18Jan 21, 2026Updated 5 months ago
- A social media open post web archiving tool☆26Feb 4, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A sentiment lexicon in classical Chinese poetry domain constructed by deep learning☆14Aug 25, 2022Updated 3 years ago
- XED multilingual emotion datasets☆64May 3, 2023Updated 3 years ago
- A PHP API for Wikisource.☆11Sep 22, 2024Updated last year
- Java client library for integration with Freja eID☆12Feb 27, 2026Updated 4 months ago
- CLI tool to record how much time it takes to import each dependency in a Python project☆11Mar 24, 2022Updated 4 years ago
- Framework for writing bots, maintenance scripts or performing data analysis on wikis powered by MediaWiki☆35Jan 31, 2026Updated 5 months ago
- Python binding for Fontconfig☆17Sep 25, 2025Updated 9 months ago