earwig / mwparserfromhellLinks
A Python parser for MediaWiki wikicode
☆827Updated 2 months ago
Alternatives and similar repositories for mwparserfromhell
Users that are interested in mwparserfromhell are comparing it to the libraries listed below
Sorting:
- A Python library to parse MediaWiki WikiText☆313Updated 4 months ago
- A Python library that interfaces with the MediaWiki API. This is a mirror from gerrit.wikimedia.org. Do not submit any patches here. See …☆696Updated this week
- Python client library to interface with the MediaWiki API☆334Updated 2 weeks ago
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/☆186Updated last week
- Wikidata client library for Python☆359Updated last week
- Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis☆591Updated 2 years ago
- Streaming WARC/ARC library for fast web archive IO☆430Updated 9 months ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆107Updated this week
- A simple interface to the Project Gutenberg corpus.☆329Updated 2 years ago
- A Python Wiktionary Parser☆363Updated last month
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆376Updated 2 years ago
- Python stemming library using snowball stemmers☆264Updated last month
- Python wrapper for Wikipedia☆694Updated this week
- A Python library for working with and comparing language codes.☆346Updated 4 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆154Updated 2 years ago
- Heuristic based boilerplate removal tool☆795Updated 6 months ago
- A modern, interlingual wordnet interface for Python☆259Updated last week
- Hy-phen-ation made easy☆212Updated 6 months ago
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engine☆186Updated 4 years ago
- a collection of functions that measure the readability of a given body of text☆196Updated 8 years ago
- A set of utilities for processing MediaWiki XML dump data.☆57Updated 7 months ago
- The Open English WordNet☆620Updated this week
- Compute PageRank on >3 billion Wikipedia links on off-the-shelf hardware.☆62Updated 10 months ago
- A Wikidata Python module integrating the MediaWiki API and the Wikidata SPARQL endpoint☆257Updated last year
- A python module for English lemmatization and inflection.☆270Updated 2 years ago
- Tools for parsing and querying Wikimedia Foundation pageview data from both static dumps and the online API.☆66Updated 3 years ago
- Python library for reading and writing warc files☆244Updated 3 years ago
- Various utilities for processing the data.☆211Updated this week
- Universal Dependencies online documentation☆289Updated this week
- A multilingual, cross-domain temporal tagger developed at the Database Systems Research Group at Heidelberg University.☆357Updated 2 years ago