earwig / mwparserfromhellLinks
A Python parser for MediaWiki wikicode
☆797Updated last month
Alternatives and similar repositories for mwparserfromhell
Users that are interested in mwparserfromhell are comparing it to the libraries listed below
Sorting:
- A Python library to parse MediaWiki WikiText☆309Updated 2 weeks ago
- A Python library that interfaces with the MediaWiki API. This is a mirror from gerrit.wikimedia.org. Do not submit any patches here. See …☆680Updated this week
- Python client library to interface with the MediaWiki API☆328Updated 2 months ago
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/☆183Updated 4 months ago
- Wikidata client library for Python☆354Updated 10 months ago
- Streaming WARC/ARC library for fast web archive IO☆415Updated 5 months ago
- Heuristic based boilerplate removal tool☆780Updated 3 months ago
- python library to simplify working with jsonlines and ndjson data☆293Updated 9 months ago
- read and edit a Wikibase instance from the command line☆231Updated 2 weeks ago
- Python library for reading and writing warc files☆240Updated 3 years ago
- Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram…☆254Updated 4 years ago
- A simple interface to the Project Gutenberg corpus.☆328Updated 2 years ago
- A Wikidata Python module integrating the MediaWiki API and the Wikidata SPARQL endpoint☆256Updated last year
- A set of utilities for processing MediaWiki XML dump data.☆53Updated 3 months ago
- Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis☆585Updated last year
- A modern, interlingual wordnet interface for Python☆247Updated this week
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆101Updated last week
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆247Updated last year
- Fact Extraction from Wikipedia Text☆535Updated 9 years ago
- A command-line tool for using CommonCrawl Index API at http://index.commoncrawl.org/☆191Updated 6 years ago
- Process Common Crawl data with Python and Spark☆431Updated last week
- A python based HTML to text conversion library, command line client and Web service.☆306Updated 2 months ago
- spellchecking library for python☆609Updated 11 months ago
- Entity linking system for Wikidata updated by your edits in real time☆254Updated 6 months ago
- Github mirror of "wikidata/query/rdf" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access…☆149Updated last week
- Python tools for interacting with Wikidata☆152Updated last year
- Filter and format a newline-delimited JSON stream of Wikibase entities☆97Updated 7 months ago
- Python stemming library using snowball stemmers☆260Updated last week
- This is a mirror of the script by Giuseppe Attardi, and contains history before the official repo started: https://github.com/attardi/wik…☆259Updated 8 years ago
- A Python library for working with and comparing language codes.☆346Updated 3 weeks ago