earwig / mwparserfromhell
A Python parser for MediaWiki wikicode
☆782Updated 2 months ago
Alternatives and similar repositories for mwparserfromhell:
Users that are interested in mwparserfromhell are comparing it to the libraries listed below
- A Python library to parse MediaWiki WikiText☆301Updated 4 months ago
- A Python library that interfaces with the MediaWiki API. This is a mirror from gerrit.wikimedia.org. Do not submit any patches here. See …☆654Updated this week
- Python client library to interface with the MediaWiki API☆325Updated last month
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/☆181Updated last month
- Wikidata client library for Python☆348Updated 8 months ago
- Heuristic based boilerplate removal tool☆758Updated 2 weeks ago
- Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis☆578Updated last year
- Port of Google's language-detection library to Python.☆1,765Updated last week
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆98Updated last week
- Streaming WARC/ARC library for fast web archive IO☆403Updated 3 months ago
- A set of utilities for processing MediaWiki XML dump data.☆51Updated last month
- A python module for English lemmatization and inflection.☆265Updated last year
- A wrapper for a remote SPARQL endpoint☆533Updated 3 months ago
- A Pythonic wrapper for the Wikipedia API☆2,932Updated 10 months ago
- ☆168Updated 9 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆150Updated last year
- Fact Extraction from Wikipedia Text☆532Updated 8 years ago
- NLP, before and after spaCy☆2,215Updated last year
- Compact Language Detector 2☆851Updated 3 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆374Updated 2 years ago
- A simple interface to the Project Gutenberg corpus.☆325Updated 2 years ago
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.☆629Updated 3 years ago
- read and edit a Wikibase instance from the command line☆231Updated 2 weeks ago
- A simple Python module for parsing human names into their individual components☆669Updated 9 months ago
- A modern, interlingual wordnet interface for Python☆233Updated last week
- Process Common Crawl data with Python and Spark☆422Updated last month
- A Wikidata Python module integrating the MediaWiki API and the Wikidata SPARQL endpoint☆254Updated last year
- The software used to extract structured data from Wikipedia☆888Updated 3 weeks ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆245Updated 10 months ago
- Multilingual text (NLP) processing toolkit☆2,327Updated last year