earwig / mwparserfromhellLinks
A Python parser for MediaWiki wikicode
☆859Updated 7 months ago
Alternatives and similar repositories for mwparserfromhell
Users that are interested in mwparserfromhell are comparing it to the libraries listed below
Sorting:
- A Python library to parse MediaWiki WikiText☆316Updated 8 months ago
- Python client library to interface with the MediaWiki API☆340Updated last month
- A Python library that interfaces with the MediaWiki API. This is a mirror from gerrit.wikimedia.org. Do not submit any patches here. See …☆728Updated this week
- Wikidata client library for Python☆364Updated 3 months ago
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/☆186Updated 3 weeks ago
- Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis☆592Updated 2 years ago
- Streaming WARC/ARC library for fast web archive IO☆445Updated last year
- A Python Wiktionary Parser☆371Updated 6 months ago
- A Python library for working with and comparing language codes.☆353Updated 9 months ago
- Python wrapper for Wikipedia☆714Updated last week
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆378Updated 3 years ago
- A modern, interlingual wordnet interface for Python☆280Updated this week
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆107Updated 2 months ago
- Heuristic based boilerplate removal tool☆811Updated 11 months ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆151Updated last year
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆155Updated 2 years ago
- A simple interface to the Project Gutenberg corpus.☆331Updated 3 years ago
- Python stemming library using snowball stemmers☆275Updated last month
- ☆178Updated 10 months ago
- Collaborative data curation for Glottolog☆184Updated this week
- Text tokenization and sentence segmentation (segtok v2)☆208Updated 3 years ago
- Crawler for linguistic corpora☆213Updated 5 months ago
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.☆632Updated 4 years ago
- The Open English WordNet☆715Updated 3 weeks ago
- A python module for English lemmatization and inflection.☆274Updated 2 years ago
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to words☆1,060Updated 8 months ago
- Python tools for interacting with Wikidata☆161Updated 2 years ago
- Various utilities for processing the data.☆217Updated last week
- A set of utilities for processing MediaWiki XML dump data.☆61Updated 11 months ago
- spellchecking library for python☆618Updated 4 months ago